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Conversion of cell movement responses to Semaphorin-1 and Plexin-1 fro 
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L_J Tanpakushitsu Kakusan Koso. 2004 Feb;49(3 Suppl):226-7. Review. Japanese. No 
abstract available. 

PMID: 14976733 [PubMed - indexed for MEDLINE] 

0 24: Nakanisbi S, Fujisawa 1L Kaneko T, Tanji J, Qhtnori H, Shibuki K, Related Articles, Links 
Noda M, Yamamori T. 

[Present status and future aspects of studies on neuronal network formation 
(discussion)] 
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Cancer Res. 2004 Feb 1;64(3):1008-15. 

PMID: 14871832 [PubMed - indexed for MEDLINE] 



G27: Kumanogoh A. Kikutani H. 



Kelated Articles, Links 



h cb 



1—1 Biological functions and signaling of a transmembrane semaphorin, 
hg e e e fcg e ch be 



Entrez PubMed 



Page 4 of 53 



CD100/Sema4D. 

Cell Mol Life Sci. 2004 Feb;61(3):292-300. Review. 
PMID: 14770294 [PubMed - indexed for MEDLINE] 

□ 28: Tsim TY. Wong EY, Leun g MS, Wong CC. Related Articles, Links 

Expression of axon guidance molecules and their related genes during 
development and sexual differentiation of the olfactory bulb in rats. 

Neuroscience. 2004; 123 (4): 95 1-65. 

PMID: 14751288 [PubMed - indexed for MEDLINE] 

fj 29: Kury P. Abankwa D, Krase K Greiner-Petter R> Mailer HW. Related Articles, Links 

Gene expression profiling reveals multiple novel intrinsic and extrinsic 
factors associated with axonal regeneration failure. 
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PMID: 14750961 [PubMed - in process] 

J 30: Fenstermaker V. Chen Y. Ghosh A. Yuste R. Related Articles, Links 



Regulation of dendritic length and branching by semaphorin 3A. 
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Cross-talk between vascular endothelial growth factor and semaphorin-3 A 
pathway in the regulation of normal and malignant mesothelial cell 
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PMID: 14586460 [PubMed - indexed for MEDLINE] 
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for growth cone collapse. 

JNeurosci. 2003 Aug 20;23(20):7602-9. 

PMID: 12930799 [PubMed - indexed for MEDLINE] 
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Structure of the semaphorin-3 A receptor binding module. 
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CIITA-regulated plexin-Al affects T-cell-dendritic cell interactions. 
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Structure and function of the immune semaphorin CD100/SEMA4D. 
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CR, CU f CZ, DE, DK, DM, DZ f EE, ES , FI , GB , GD , GE , GH , GM , HR, 

HU, ID, IL, IN, IS, DP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, 
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author] 

CS Independent Research Group Neurodegeneration, MPI of Psychiatry, 

Kraepelinstrasse 2, 80804, Munich, Germany 

chris@mpipsykl .mpg.de 
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CODEN: JBCHA3. ISSN: 0021-9258. 
DT Article 
LA English 
OS Genbank-AF28866 
ED Entered STN: 31 Jan 2001 

Last Updated on STN: 15 Feb 2002 



L4 ANSWER 29 OF 86 BIOSIS COPYRIGHT 2004 BIOLOGICAL ABSTRACTS INC. on STN 
DUPLICATE 8 



AN 2000:261050 BIOSIS 
DN PREV200000261050 

TI The transmembrane protein ***Semaphorin*** repels 

embryonic sympathetic axons. 
AU xu, Xiao-Mei; Fisher, Daniel A.; Zhou, Lijuan; White, Fletcher A.; Ng, 

Sheldon; snider, William D. ; Luo, Yuling [Reprint author] 
CS Exelixis Pharmaceuticals, inc., 260 Littlefield Avenue, South San 

Francisco, CA, 94080, USA 
SO Journal of Neuroscience, (April 1, 2000) Vol. 20, No. 7, pp. 2638-2648 

print. 

CODEN: JNRSDS. ISSN : 0270-6474. 
DT Article 
LA English 

ED Entered STN : 21 Jun 2000 

Last Updated on STN: 5 Jan 2002 

L4 ANSWER 30 OF 86 MEDLINE on STN 

AN 1999160821 MEDLINE 
DN PubMed ID: 10049528 

TI Cloning and characterization of a novel class VI semaphorin, semaphorin Y 
AU Kikuchi K; chedotal A; Hanafusa H; ujimasa Y; de Castro f: Goodman c S- 

Kimura T ' 
CS Sumitomo Pharmaceuticals Research Center, 3-1-98, Kasugade-Naka, Konohana. 

Osaka, 554-0022, Japan. ' 
so Molecular and cellular neurosciences , (1999 Jan) 13 (1) 9-23 

Journal code: 9100095. ISSN: 1044-7431. 
CY United states 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority Journals 

OS GENBANK-AB000817; GENBANK-AB013729 ; GENBANK-AB014074 
EM 199904 

ED Entered STN: 19990426 

Last Updated on STN: 19990426 
Entered Medline: 19990415 

L4 ANSWER 31 OF 86 BIOSIS COPYRIGHT 2004 BIOLOGICAL ABSTRACTS INC. on STN 

DUPLICATE 9 
AN 1999:59333 BIOSIS 
DN PREV199900059333 

TI Semaphorins ill and IV repel hippocampal axons via two distinct receptors 
AU chedotal, Alain [Reprint author]; Del Rio, Jose A.; Ruiz, Monica; He, 
Zhigang; Borrell , Victor; De Castro, Fernando; Ezan, Frederic; Goodman, 
Corey s.; Tessier-Lavigne, Marc; sotelo, Constantino; Soriano, Eduardo 
CS inserm U106, Hop. Sal pet ri ere, 75013 Paris, France 

SO Development (Cambridge), (Nov., 1998) Vol. 125, No. 21, pp. 4313-4323 
print. 

CODEN: DEVPED. ISSN: 0950-1991. 
DT Article 
LA English 

ED Entered STN: 16 Feb 1999 

Last Updated on STN: 16 Feb 1999 

L4 ^™ E * 32 0F 86 BI0SIS COPYRIGHT 2004 BIOLOGICAL ABSTRACTS INC. on STN 
AN 1999:49302 BIOSIS 
DN PREV199900049302 

TI Transmembrane protein ***semaphorin*** *** VIa *** -j s rep ulsive for 

axons of embryonic sympathetic neurons. 
AU Xu X M. [Reprint author]; Fisher, D. A.; Zhou, L.; Ng, s. [Reprint 

author]; White, F. A.; Snider, w. D.; Luo, Y. [Reprint author] 
CS USA ™' Inc ■ , 260 Littlef " ie " | d Ave., south San Francisco, CA 94080, 

SO society for Neuroscience Abstracts, (1998) Vol. 24, No. 1-2, pp 539 
print. 

Meeting Info.: 28th Annual Meeting of the Society for Neuroscience, Part 

1. Los Angeles, California, USA. November 7-12, 1998. Society for 

Neuroscience. 

ISSN: 0190-5295. 
DT Conference; (Meeting) 

Conference; Abstract; (Meeting Abstract) 

conference; (Meeting Poster) 
LA English 

ED Entered STN : 10 Feb 1999 

Last Updated on STN: 10 Feb 1999 



L4 ANSWER 33 OF 86 CAPLUS COPYRIGHT 2004 ACS on STN DUPLICATE 10 



AN 1997:373464 CAPLUS 
DN 127:106980 

TI cloning and expression of a novel murine semaphorin with structural 

similarity to insect semaphorin I 
AU Zhou, [_.; white, F. A.; Lentz, s. I.; Wright, D. E. 

cs Center Study Nervous System injury, Dep. Neurology, Washinqton Univ., St. 

Louis, MO, 63110, USA 
SO Molecular and Cellular Neuroscience (1997), 9(1), 26-41 

CODEN: MOCNED; ISSN: 1044-7431 
PB Academic 
DT Journal 
LA English 

L4 ANSWER 34 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAG79413 Protein DGENE 

TI New human cell adhesion proteins (CADHP) useful for treating, diagnosing 
and preventing diseases or conditions associated with the aberrant CADHP 
expression e.g. cancer, acquired immunodeficiency syndrome, Alzheimer's 
disease and epilepsy - 

IN Duggan B M; xu Y; Lee E A; Lee S; Lu D A M; Warren B A; Yue H; Gietzen K 
J; Honchell c D; Burford N; Baughn M R; Tang T Y; Hillman J L; Gandhi A 
R; Kallick D A; Bandman o; Graul R C; walia N K; Lu Y; Ramkumar 3 ; Yao M 
G; Lai P G 

PA (INCY-N) INCYTE GENOMICS INC. 

PI WO 2002059312 A2 20020801 149p 

AI WO 2001-US49206 20011218 

PRAI US 2000-256542P 20001218 
US 2000-259604P 20001222 
US 2001-260101P 20010105 

DT Patent 

LA English 

OS 2002-590826 [63] 

CR N-PSDB: ABA00055 

DESC CADHP-2, incyte ID No: 7596315CD1. 

L4 ANSWER 35 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ABB11205 peptide DGENE 

TI Human proteins and DNA encoding sequences useful for preventing, treating 
or ameliorating a medical condition in a mammalian subject e.g. arthritis 
and cancer - 

IN Tang Y T; Liu C; Drmanac R T 

PA (HYSE-N) HYSEQ INC. 

PI WO 2001057188 A2 20010809 999p 
AI WO 2001-US3800 20010205 
PRAI US 2000-496914 20000203 
US 2000-560875 20000427 
DT Patent 
LA English 
OS 2001-457740 [49] 
CR N-PSDB: ABA08449 

DESC Human ***semaphorin*** *** v -j a *** homologue, SEQ id NO: 1575. 

L4 ANSWER 36 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN AAB82595 Protein DGENE 

TI Nucleic acid molecules encoding human secreted proteins, used in 
preventing, treating or ameliorating a disorder, e.g. Alzheimer's 
diseases and cancers - 

IN Ruben s M ; shi Y 

PA (HUMA-N) HUMAN GENOME SCI INC. 

PI WO 2001053343 Al 20010726 339p 

AI WO 2001-US1436 20010117 

PRAI US 2000-176307 20000118 

DT Patent 

LA English 

OS 2001-483137 [52] 

CR N-PSDB: AAH26247 

DESC Human secreted protein encoded by cDNA clone HE8QT72. 

L4 ANSWER 37 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAY71461 peptide DGENE 

TI Nucleic acid coding for human ***semaphorin*** ***s A *** _ ***]*** 
used as diagnostic agent, therapeutic agent, for modulating immune 
system, in gene therapy or for effecting differentiation, cytoskeletal 
stabilization and/or plasticity - 

IN Behl C; Klostermann A 

PA (PLAC) MAX PLANCK GES FOERDERUNG WISSENSCHAFTEN. 



PI 

AI 

PRAI 

DT 

LA 

OS 

CR 

DESC 

L4 
AN 
TI 



IN 

PA 

PI 

AI 

PRAI 

DT 

LA 

OS 

CR 

DESC 



L4 
AN 
TI 



IN 



PA 
PI 
AI 
PRAI 



DT 
LA 
OS 
CR 

DESC 

L4 
AN 
TI 



IN 
PA 
PI 
AI 
PRAI 

DT 
LA 
OS 
CR 

DESC 



WO 2000031252 Al 20000602 
WO 1999-EP9215 19991126 
EP 1998-122441 19981126 
Patent 
English 

2000-400065 [34] 
N-PSDB: AAD01234 
Binding domain of human 



53p 



semaphorin*** 



>6A* 



ANSWER 38 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AAY71460 Protein DGENE 

Nucleic acid coding for human ***semaphorin*** ***5 A *** _ 
used as diagnostic agent, therapeutic agent, for modulating immune 
system, in gene therapy or for effecting differentiation, cytoskeletal 
stabilization and/or plasticity - 
Behl c; Klostermann A 

(PLAC) MAX PLANCK GES FOERDERUNG WISSENSCHAFTEN . 

WO 2000031252 Al 20000602 53p 

WO 1999-EP9215 19991126 

EP 1998-122441 19981126 

Patent 

English 

2000-400065 [34] 
N-PSDB: AAD01233 

Human ***semaphorin*** ***5 A *** _ 

ANSWER 39 OF 86 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
ABA00055 CDNA DGENE 

New human cell adhesion proteins (cadhp) useful for treating, diagnosing 
and preventing diseases or conditions associated with the aberrant cadhp 
expression e.g. cancer, acquired immunodeficiency syndrome, Alzheimer's 
disease and epilepsy - 

Duggan B M ; Xu Y; Lee E A; Lee S; LuDAM; Warren B A; Yue H; Gietzen K 
3; Honchell c D; Burford n; Baughn M R; Tang T Y; Hi 11 man 3 L; Gandhi A 
R; Kallick D a; Bandman o; Graul R C; Walia N K; Lu Y; Ramkumar J; Yao M 
g; Lai p G 

(INCY-N) INCYTE GENOMICS INC. 

WO 2002059312 A2 20020801 149p 

WO 2001-US49206 20011218 

US 2000-256542P 20001218 

US 2000-259604P 20001222 

US 2001-260101P 20010105 

Patent 

English 

2002-590826 [63] 
P-PSDB: AAG79413 

CADHP-2 coding sequence, Incyte ID No 



7596315CB1. 
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ABA08449 CDNA DGENE 

Human proteins and DNA encoding sequences useful for preventing, treating 
or ameliorating a medical condition in a mammalian subiect e.q. arthritis 
and cancer - 

Tang Y T; Liu C; Drmanac R T 
(HYSE-N) HYSEQ INC. 
WO 2001057188 A2 20010809 
WO 2001-US3800 20010205 
US 2000-496914 20000203 
US 2000-560875 20000427 
Patent 
English 

2001-457740 [49] 
P-PSDB: ABB11205 
Human ***semaphori n*** 
NO:225. 



999p 



Via*** homologue-encoding cDNA, SEQ ID 
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AN AAH26247 CDNA DGENE 

TI Nucleic acid molecules encoding human secreted proteins, used in 
preventing, treating or ameliorating a disorder, e.g. Alzheimer's 
diseases and cancers - 

IN Ruben s M; shi Y 

PA (HUMA-N) HUMAN GENOME SCI INC. 

PI WO 2001053343 Al 20010726 339p 

AI wo 2001-US1436 20010117 

PRAI US 2000-176307 20000118 



DT Patent 

LA English 

OS 2001-483137 [52] 

CR P-PSDB: AAB82595 

DESC Human secreted protein cDNA clone HE8QT72. 
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AN AAD01234 DNA DGENE 

TI Nucleic acid coding for human ***semaphorin*** ***5 A *** _ 

used as diagnostic agent, therapeutic agent, for modulating immune 
system, in gene therapy or for effecting differentiation, cytoskeletal 
stabilization and/or plasticity - 

IN Behl C; Klostermann A 

PA (PLAC) MAX PLANCK GES FOERDERUNG WISSENSCHAFTEN. 

PI WO 2000031252 Al 20000602 53p 

AI WO 1999-EP9215 19991126 

PRAI EP 1998-122441 19981126 

DT Patent 

LA English 

OS 2000-400065 [34] 

CR P-PSDB: AAY71461 

DESC DNA encoding binding domain of human ***semaphorin*** ***5 A *** _ 
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AN AAD01233 DNA DGENE 

TI Nucleic acid coding for human ***semaphorin*** ***5 A *** _ ***]*** 
used as diagnostic agent, therapeutic agent, for modulating immune 
system, in gene therapy or for effecting differentiation, cytoskeletal 
stabilization and/or plasticity - 

IN Behl c; Klostermann A 

PA (PLAC) MAX PLANCK GES FOERDERUNG WISSENSCHAFTEN. 

PI WO 2000031252 Al 20000602 53p 

AI WO 1999-EP9215 19991126 

PRAI EP 1998-122441 19981126 

DT Patent 

LA English 

OS 2000-400065 [34] 

CR P-PSDB: AAY71460 

DESC Human ***semaphorin*** ***s A *** _ ***]*** cdna. 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank version (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



(ST): 



KEYWORDS 
SOURCE: 
ORGANISM (ORGN) 



CF132761 GenBank (R) 
CF132761 

CF132761.1 GI:40455103 

634698-57-0 

536 

mRNA; linear 
Expressed sequence tag 
31 Dec 2003 

4-4-A Chick ventral neuroepithelium cdna library Gallus 
gall us cDNA similar to ***semaphorin*** ***s A *** 
, mRNA sequence. 
EST 

Gal 1 us 
Gallus 



gallus (chicken) 
gal 1 us 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Archosauria; Aves ; Neognathae; 
Gall i formes; Phasianidae; Phasianinae; Gallus 

COMMENT: 

Contact: Soul a, c. 

Centre de Biologie du Developpement , UMR 5547 CNRS/UPS 

Universite Paul Sabatier, 118 route de Narbonne, 31062 Toulouse 

cedex, France 

Tel : 33 5 61 55 64 23 

Fax: 33 5 61 55 65 07 

Emai 1 : soul z@ci ct . f r 

POLYA=No. 

REFERENCE: 1 (bases 1 to 536) 

AUTHOR (AU): Braquart-Varnier,C. ; Clouscard-Martinato,C. ; Agius,E.; 

Escalas,N.; Danesin,C; Benazeraf ,B. ; Cochard,P.; 
Soul a, C. 

TITLE (TI) : Identification of new genes expressed in the gliogenic 

ventral neuroepithelium 
JOURNAL (so): unpublished (2003) 



FEATURES (FEAT) 
Feature Key 



Location 



Qua! ifier 



source 



1. .536 



/orqanism="Gallus gallus" 
/moT-type="mRNA" 
/db-xref="taxon : 9031" 
/tissue-type="ventral 
neuroepithelium" 
/dev-stage="Hamburger Hamilton 
stage 29-30" 

/clone-lib="chick ventral 
neuroepithelium cDNA library" 
/note= 'Vector: pT-Adv; Screening 
from a subtracted embryonic E6 
ventral neuroepithelial cDNA 
library Chick embryonic ventral 
neuroepithelial cDNA cloned into 
pT-Adv using T/A - based cloning 
system (AdvanTAge PCR cloning kit, 
Clontech)" 



SEQUENCE (SEQ): 
1 tgtatctgtg 
61 acccactccc 
121 gctcagtcca 
181 ttggctacac 
241 ctggctgccc 
301 cgtggcagta 
361 ccaccaatgg 
421 attcccagtg 
481 cagccaaaaa 



accatcgccg gagagatgtg 
gccgcggctc catgagcagt 
aagaaccaaa gcctgaagct 
caggcagcac taccaagatg 
tgccaacccc ggaatccacc 
gagaatggga aagaaaccag 
cctcacccgt gatcccaact 
tcatggttct gcccattgcc 
tgagcgacgg ggctcagatg 



gctgttgtgc agagaaagga 
gtaaccaagc tcagcggcct 
atcctcacac ctttgatgca 
ctaatcaagg ctgaccagca 
ccaaccctgc agcagaaaag 
aacctcatca atgcctgcac 
gacctgccac tcagggcttc 
cagcaaggct accaacacga 
tccgtggagg accagaatgc 



gaaggaactg 
ctttggagat 
caatggcaag 
ccatctggac 
gaagcccagc 
aaaagatatc 
tcctagccac 
gtatgttgac 
caccct 
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LOCUS (LOC) : 
GenBank ACC. NO. (gbn) 
GenBank VERSION (VER) 
CAS REGISTRY NO 
SEQUENCE LENGTH 
MOLECULE TYPE (CI) 
DIVISION CODE (CI) 
DATE (DATE): 
DEFINITION (DEF): 



(RN) 
(SQL) 



GenBank (R) 
GI: 38566245 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN) ; 



BC062979 
BC062979 
BC062979.1 
622741-81-5 
4139 

mRNA ; linear 
Rodents 
11 Dec 2003 

Mus musculus sema domain, transmembrane domain (TM) , 
and cytoplasmic domain, ( ***semaphorin*** ) 

*** 6A *** f mRNA (cDNA clone MGC ;86119 IMAGE: 6841689), 
complete cds. 
MGC 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

contact: MGC help desk 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. James Lin, University of Iowa 

cdna Library Preparation: M. Bento soares, University of Iowa 

cdna Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: University of Iowa, Dr. M. Bento soares and Dr. 

Thomas L. Casavant. 

Web site: http://genome.uiowa.edu 

Contact: bento-soares@uiowa.edu; tom-casavant@uiowa.edu 
Bonaldo,M.F. , Akabogu,l., Bair,T., Bair,J., crouch, K., Davis, A., 
Fishier, K., Keppel,c, Kucaba.T. , Lebeck,M., Melo,A., Schaefer,K., 
Scheetz,T., Smith, o, Snir,E., Tack,D., Trout, K., Walters, J., 
Casavant, T., Soares, M.B. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: Plate: Row: column: 0. 
REFERENCE: 1 (bases 1 to 4139) 

AUTHOR (AU): Strausberg,R.L. ; Feingold.E.A. ; Grouse, L.H.; 

Derge,J.G.; Klausner ,R.D. ; Collins, F.S.; Wagner, L.; 
Shenmen,C.M. ; schuler ,G.D. ; a! tschul ,s . F. ; 2eeberg,B.; 
Buetow,K.H.; schaefer , c. F. ; Bhat f N.K.; Hopkins ,R. f. ; 
Jordan, H.; Moore, T.; Max,s.l.; Wang, J.; Hsieh,F.; 



TITLE (TI): 

JOURNAL (SO) 

OTHER SOURCE 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 
Hong,L.; stapleton,M. ; Soares.M.B.; Bonal do ,M . F. ; 
Casavant.T.L. ; Scheetz ,T.E. ; Brownstei n ,M . J . ; 
Usdin.T.B.; Toshiyuki ,s. ; Carninci,P.; Prange,C; 
Raha,s.s.; Loquellano,N.A. ; Peters, G. J.; Abramson , R.D. ; 
Mullahy,s.J. ; Bosak,S.A.; McEwan,P.J.; McKernan, K. J . ; 
Malek,J.A.; Gunaratne,P.H. ; Richards, s.; Worley,K.C; 
Hale,s.; Garcia, A.M.; Gay.L.J.; Hulyk,s.w.; 
V-illalon,D.K. ; Muzny,D.M. ; Sodergren , E . J . ; Lu,x.; 
Gi bbs , R . A. ; Fahey,J.; Helton, E.; Ketteman,M.; Madan,A.; 
Rodngues,S.; Sanchez,A.; Whiting, M.; Madan.A.; 
Young.A.C; shevchenko , Y. ; Bouffard.G.G. ; 
Blakesley,R.w. ; Touchman, J .w. ; Green,E.D.; 
Dickson, m. C. ; Rodriguez, A. C. ; Grimwood,J.; Schmutz.J ■ 
Mvers,R.M.; Butterfield,Y.S. ; Krzywinski ,M.I. ; 
skalska,U.; Smailus ,D.E. ; Schnerch,A.; Schein,J.E.; 
Jones, S.J. ; Marra,M.A. 

Generation and initial analysis of more than 15,000 
full-length human and mouse cDNA sequences 
Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 
(2002) 
(OS): CA 138:84323 

2 (bases 1 to 4139) 
Strausberq,R. 
Direct submission 

Submitted (26-NOV-2003) National Institutes of Health, 
Mammalian Gene Collection (MGC) , Cancer Genomics 
office, National Cancer Institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 

Feature Key Location 

source 1..4139 



Qualifier 



gene 



CDS 



1..4139 



248. .3178 



/orqanism= M Mus musculus" 

/mol-type= ,I mRNA" 

/strain= ,, c57BL/6" 

/db-xref= M taxon : 10090" 

/cl one="MGC : 86119 IMAGE : 6841689 " 

/tissue-type="Brain n 

/clone-lib= n NlH-BMAP-GH0 H 

/lab-host="DHl0B n 

/note="Vector : pYX-ASC" 

/gene= n Sema6a" 

/note="synonyms: Via, sema, 

Sema6A-l, A730020P05Rik" 

/db-xref="LocusID: 20358" 

/db-xref="MGl : 1203727" 

/codon-start=l 

/product= n sema6a protein" 

/protein-id="AAH62979.1" 

/db-xref="GI : 38566246" 

/db-xref="LocusID: 20358" 

/translation="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFNPSCRNYRVDTLETFGDEFS 

GMARCPYDAKHANIALFADGKLYSATVTDFLAID 

AVI YRS LGDS PTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTSF 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

V LAT FST P YN S I PGS AVCA YDM L D 

IANVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTHPLMDEAVPSIINRPWFLRTMVRYRLTKIAVD 

NAAGPYQNHTWFLGSEKGIILKF 

LARIGSSGFLNGSLFLEEMNVYNPEKCSYDGVED 

KRIMGMQLDRASGS LYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWVRESGSC 

AHLSPLSRLTFEQDIERGNTDGLG 

DCHNSFVALNGVIRESYLKSNDQLVPVTLLAIAV 

ILAFVMGAVFSGIIVYCVCDHRRK 

DVAWQRKEKELTHSRRGSMSSVTKLSGLFGDTQ 

SKDPKPEAILTPLMHNGKLATPSN 



misc-feature 413. .1678 
misc-feature 1787. .1888 



TAKMLIKADQHHLDLTALPTPESTPTLQQKRKPN 
RGS R EWERNQN 1 1 NACTKDM P PMG 
SPVIPTDLPLRASPSHIPSVWLPITQQGYQHEY 
VDQ P KM S E WAQM A L E DQAAT LEY 
KTIKEHLSSKSPNHGVNLVENLDSLPPKVPQREA 
SLGPPGTSLSQTGLSKRLEMQHSS 
SYGLEYKRSYPTNSLTRSHQTTTLKRNNTNSSNS 
SHLSRNQSFGRGDNPPPAPQRVDS 
IQVHSSQPSGQAVTVSRQPSLNAYNSLTRSGLKR 
TPSLKPDVPPKPSFAPLSTSMKPN DACT" 
/note= n Sema; Region: semaphorin 
domain" 

/db-xref ="cdd : smart00630" 
/note="PSl; Region: domain found 
in Plexins, Semaphorins and 
Integrins" 

/db-xref= ,, CDD:smart00423" 



SEQUENCE (SEQ) : 

1 acacagctcg cttgcatccc agaaagtaat cgccgtgact gttgccccca aagagacaag 
61 cacacatgta ggaatgacaa aggcttgcga aggaaagagc gcagctcaag gcccagagag 
121 atcttctcga taatggatta ctaaatggga tactcgccgt gtcagtccgc tccgagccgc 
181 ggccgcctgc ccgtcgatgc accgaaaagg gtgaagtaga gaagcaagtc cccccgctga 
241 acctactatg cggccagcag ccttactgct gtgtctcaca ctgctacact gcgctggggc 
301 tggtttccca gaagattccg agccaatcag tatttcgcat ggcaactata caaaacagta 
361 tccggtgttt gtgggccaca agccaggacg gaacaccacg cagaggcaca ggctggacat 
421 ccagatgatc atgatcatga acagaaccct ctacgttgct gctcgagacc atatttatac 
481 tgttgatata gacacatccc acacagaaga aatttactgt agcaaaaaac tgacatggaa 
541 atctagacag gctgacgtag acacatgcag gatgaagggg aaacataagg atgaatgtca 
601 caacttcatt aaagttcttc tcaagaagaa tgatgatacg ctgtttgtct gtggaaccaa 
661 tgccttcaac ccttcctgca gaaactacag ggtcgatacc ttggaaactt ttggggatga 
721 atttagcgga atggccagat gcccttatga tgccaaacat gccaacatcg ctctgtttgc 
781 agatggaaaa ctctactcgg ctacagtgac tgactttctg gccattgatg cagtcattta 
841 caggagcctc ggagacagcc ctaccctcag gactgtcaag catgattcaa agtggttgaa 
901 agagccgtac tttgtccaag ccgtggatta tggggactat atctacttct tcttcagaga 
961 aattgcagta gaatacaaca ctatggggaa ggttgttttc cctagggtgg ctcaggtctg 
1021 taagaatgac atgggagggt ctcagagagt cctggagaag cagtggacat ctttcctgaa 
1081 ggctcgcctg aactgctcgg tgcctggaga ctctcatttt tatttcaata tactccaggc 
1141 agttacagat gtgattcgca ttaatggccg tgatgttgtc ttggcaacct tttccacacc 
1201 ttataacagc atcccaggtt ctgcagtctg tgcctatgac atgcttgaca ttgctaatgt 
1261 tttcactggg aggttcaagg aacagaaatc acctgactct acctggacac ccgttccaga 
1321 cgaacgagtc cctaagccca ggccaggctg ttgtgctgga tcatcctctt tagaaaaata 
1381 tgcaacctcc aatgagtttc ccgatgatac cctgaacttc attaagacgc atccactcat 
1441 ggacgaggca gtgccttcca tcatcaacag accttggttc ctgagaacaa tggtcagata 
1501 ccgcctgacc aaaattgcag tagacaacgc tgccgggcca tatcagaatc acactgtggt 
1561 tttcctggga tcagaaaagg gaatcatcct gaagttcttg gccaggatag gaagcagtgg 
1621 tttcctaaat ggcagccttt tcctggagga gatgaatgtt tacaacccag aaaagtgcag 
1681 ctatgatggt gtagaagaca aaaggatcat gggcatgcag ctcgacagag cgagtggctc 
1741 actctatgtt gcattctcta cttgtgtgat caaggtgcct cttggccgct gtgagcgaca 
1801 tgggaagtgt aaaaaaacct gcatcgcctc cagagacccg tattgtgggt gggtaaggga 
1861 aagtggttcc tgtgcccatc tgtcacccct tagcagactg acatttgagc aggacattga 
1921 gcgtggcaat acggacggcc taggagactg tcacaattcc ttcgtggcac tgaatggagt 
1981 gattcgggaa agttacctca aaagcaacga ccagcttgtt cctgtcaccc tcctggccat 
2041 tgcagtcatt ctggcttttg tcatgggggc cgtcttctcg ggcatcatcg tgtattgtgt 
2101 gtgcgatcac cggcgcaaag acgtggcagt agtgcagcgc aaggagaaag agctcactca 
2161 ctcgcgtcgg ggatctatga gcagtgtcac caagctcagt ggcctctttg gggacaccca 
2221 gtccaaggac ccaaagcctg aggccatcct cacaccactc atgcacaacg gcaagctggc 
2281 cacgcctagc aacaccgcca agatgctcat caaggctgac cagcatcacc tagacctcac 
2341 cgccctgccc accccagagt ccaccccgac actgcagcag aaacggaaac ccaaccgcgg 
2401 cagtcgcgag tgggagagga accagaacat catcaatgcc tgcaccaagg acatgcctcc 
2461 catgggttcc cctgtgattc ccacggacct gcccctccgg gcctccccaa gccacatccc 
2 521 cagcgtggtg gtcctgccca tcacgcagca gggctaccag cacgagtacg tagatcagcc 
2581 caaaatgagc gaggtggtgg ctcagatggc actggaggac caggctgcca ccctggagta 
2641 taagaccatc aaagagcacc tgagtagcaa gagtcccaac catggggtga accttgtgga 
2701 gaacctggac agcctgcccc ctaaagttcc acagcgcgag gcctccctag gtcccccggg 
2761 aacctcactg tcacaaaccg gcctgagcaa gaggctggag atgcaacact cctcctccta 
2821 tgggctcgaa tataagagga gctaccccac gaactcgctc acaagaagcc accagaccac 
2881 cactctcaaa agaaacaata ctaactcctc caattcctcc cacctctcca ggaaccagag 
2941 ctttggccgg ggagacaacc caccccccgc cccgcagcgg gtggactcta tccaggtgca 
3001 cagctcccag ccctctggcc aggccgtgac tgtttcgagg cagcccagcc tcaatgccta 
3061 caactcactg acgaggtcgg ggctgaagcg caccccctcg ctaaagccag atgtaccccc 
3121 caaaccttcc tttgctcccc tttccacatc catgaagccc aatgatgcat gtacataatc 
3181 ccaggggttg gggggccagg agttgaagca tccgatgagg cgaggcaccc acccagctga 
3241 gcaaggtctc cattgcctcg agtatccacc cgaccaagat ggccctggag gagctgagga 
3301 cgctgggtcc tcctccctgg gacataggga tactcttcaa aattgggccg tgtggtctgg 
3361 tgaaggttta gcaatccacc tctgcgttct cttccttcac atccacgtca tacagcaggt 



3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 



cggactcacg 
atcctcacat 
acacacacac 
catcacacca 
aggacgttac 
attttttttt 
attctaactc 
caccagtgtt 
tttctgctcc 
attgtccacc 
cagctctgcg 
aattgagttt 



aaagacttca 
gaacacaggc 
acacacacca 
caccacacac 
caaacttggc 
ttaaaaaaaa 
agactttaac 
acagctttgt 
tcttcgcttt 
ccttttcttt 
gaagcccgtg 
ctttattttt 



gtatcatcac 
cacagtcaca 
taccacacca 
gcaatggtaa 
taagcatcag 
tcatgaaaat 
aatggcagaa 
tatagcaggt 
taatgaataa 
ttcctcaagg 
gaaagttaaa 
acaataaatt 



aaacatgagc 
ttcgtgcact 
caccacacca 
acgacaacca 
aaactttaaa 
taaaaaaaaa 
gtttactatg 
aaatgccatg 
cgtgaccgtt 
aaaagactct 
ctatttaacg 
cactgagtaa 



caaaagcaca tctccacccc 
acatacacat acacacacac 



caccacacca 
cattttgtcc 
acacacacac 
taataaagga 
cgcagatact 
ttgggcaacg 
aacgcaagta 
tccaaatgcc 
tgaaatccat 
ataaaaaaaa 



caccacacca 
ctgactatac 
acacacaacc 
ttcattgata 
gtgaaatgcc 
atgtcataga 
actcttattt 
accacatgaa 
taactggaat 
aaaaaaaaa 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF) : 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BC059238 GenBank (R) 
BC059238 

BC059238.1 Gl: 37748386 

608850-56-2 

4702 

mRNA; linear 

Rodents 

20 Oct 2003 

Mus musculus sema domain, transmembrane domain (TM) , 
and cytoplasmic domain, ( ***semaphorin*** ) 

***6A*** , mRNA (cDNA clone MGC:66957 IMAGE: 6417475) , 
complete cds. 
MGC „ 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Eutel eostomi ; Mammal i a ; Euthe ri a ; Rodent i a ; 
Sciurognathi ; Muridae; Murinae; Mus 
1255 a 1315 c 1076 g 1056 t 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 

cDNA Library Preparation: M. Bento Soares University of Iowa 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanfjord, CA 94305 

Web site: http://www-shgc.stanford.edu c 

Contact: (Dickson, Mark) mcd@paxil.stanforcl.edu 

Dickson, M. , Schmutz, 3 . , Grimwood, 3 . , Rodriquez, A., and Myers, 

R. M. 

clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL- at :Jittp: //image. 11 nl .gov 
Series: IRAK Plate: 125 Row: o Column: 20 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 9055333. 



REFERENCE: 

AUTHOR (AU) : 



TITLE (TI) 



1 (bases 1 to 4702) 
strausberg,R.L. ; Feingold , E.A. ; Grouse, L.H. ; 
Derge,3.G.; Klausner ,R.D. ; Collins,F.S.; Wagner, L. ; 
shenmen,C.M. ; schuler ,G.D. ; Altschul ,s. F. ; Zeeberg,B.; 
Buetow,K.H.; Schaefer ,C.F. ; Bhat,N.K.; Hopkins , R. F. ; 
Jordan, H.; Moore, T.; Max,s.I.; Wang, J.; Hsieh,F.; 
Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin.G.M.; 
Hong,L.; stapleton ,M. ; Scares, M.B.; Bona! do ,M . F. ; 
Casavant ,T.L. ; Scheetz,T.E. ; Brownstei n , M . J . ; 
Usdin,T.B.; Toshiyuki ,s . ; carninci,P.; Prange,C.; 
Raha,s.s.; Loquellano,N.A. ; Peters, G. J.; Abramson,R.D. 
Mullahy ,S.J . ; Bosak,S.A.; McEwan, P.J . ; McKernan , K . J . ; 
Malek,3.A.; Gunaratne,P.H. ; Richards,s.; Worley,K.C; 
Hale,s.; Garcia, A.M.; Gay,L.J.; Hulyk,s.w.; 
villalon,D.K. ; Muzny , D.M . ; sodergren , E . 3 . ; Lu,X.; 
Gibbs,R.A.; Fahey,J.; Helton,E.; Ketteman,M.; Madan,A. 
Rodrigues,s. ; Sanchez, A.; Whiting, M.; Madan,A.; 
Young, A. c. ; shevchenko, Y. ; Bouffard,G.G. ; 
Blakesley,R.W. ; Touchman, 3 .W. ; Green, E.D.; 
Dickson, M. c. ; Rodriguez, A. C. ; Grimwood, 3.; schmutz, 3.; 
Myers, R.M.; Butterfield,Y.S. ; Krzywinski ,M.I. ; 
Skalska,u.; Smailus.D.E. ; Schnercn,A.; schein,3.E.; 
Jones, s. 3.; Marra,M.A. 

Generation and initial analysis of more than 15,000 
full-length human and mouse cDNA sequences 



JOURNAL (SO): 

OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI) : 
JOURNAL (SO): 



FEATURES (FEAT): 
Feature Key 



Acad. Sci. U.S.A., 99 (26), 16899-16903 



Proc. Natl 
(2002) 
CA 138:84323 
2 (bases 1 to 4702) 
Strausberg,R. 
Direct Submission 

Submitted (01-OCT-2003) National Institutes of Health, 
Mammalian Gene Collection (MGC) , Cancer Genomics 
Office, National cancer institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



Location 



Qualifier 



source 



1. .4702 



gene 



CDS 



1. .4702 



632. .3727 



misc-feature 797. .2062 



misc-feature 2171. .2272 



/organism="Mus musculus" 

/moT-type="mRNA" 

/strain="C57BL/6" 

/db-xref="taxon : 10090 n 

/cl one="MGC : 66957 IMAGE : 6417475 " 

/tissue-type="Brain , enriched 

mouse brain 12.5dpc" 

/cl one-1 i b="NlH-BMAP-Fl0" 

/lab-host="DHl0B" 

/note= M vector: pYX-ASC" 

/gene= M sema6a n 

/note= M synonyms : via, sema, 

Sema6A-l, A730020P05Rik ,f 

/db-xref="LocuslD: 20358" 

/db-xref="MGI: 1203727" 

/codon-start=l 

/product="Sema6a protein" 

/protein-id="AAH59238.1" 

/db-xref="GI: 37748387" 

/db-xref="LocuslD: 20358" 

/t ransl ati on="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFNPSCRNYRVDTLETFGDEFS 

GM ARCP YDAKH AN I A L FADGK LYSATVTD F LAI D 

AVI YRS LGDS PTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTSF 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

V LAT FST P YN S I PGS AVCAYDM LD 

IANVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTH P LMDEAV PS 1 1 N R PWF L RTMVR YR LTK I AVD 

NAAGPYQNHTWFLGSEKGIILKF 

LARIGSSGFLNGSLFLEEMNVYNPEKCSYDGVED 

KRIMGMQLDRASGSLYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWVRESGSC 

AHLSPLSRLTFEQDIERGNTDGLG 

DCHNSFVALNGHASSLYPSTTTSDSASRDGYESR 

GGMLDWNDLLEAPGSTDPLGAVSS 

HNHQDKKGVIRESYLKSNDQLVPVTLLAIAVILA 

FVMGAVFSGIIVYCVCDHRRKDVA 

WQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKD 

PKPEAILTPLMHNGKLATPSNTAK 

MLIKADQHHLDLTALPTPESTPTLQQKRKPNRGS 

REWERNQNIINACTKDMPPMGSPV 

I PTD L P L RAS PS H I PS VW L PITQQGYQH E YVDQ 

PKMSEWAQMALEDQAATLEYKTI 

KEHLSSKSPNHGVNLVENLDSLPPKVPQREASLG 

PPGTSLSQTGLSKRLEMQHSSSYG 

LEYKRSYPTNSLTRSHQTTTLKRNNTNSSNSSHL 

S RNQS FGRGDNPPPAPQRVDS IQV 

HSSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPS 

LKPDVPPKPSFAPLSTSMKPNDAC T" 

/note="Sema; Region: semaphorin 

domain" 

/db-xref="CDD:smart00630" 
/note="PSl; Region: domain found 
in Plexins, Semaphorins and 
Integrins" 



SEQUENCE (SEQ) : 

1 ttttccgcgg tgccgattgc ccgagatgcc 
61 cgccacaccc gggacactgc actcggggtt 
121 ctcaaccccc cacaccgccc cccgccctga 
181 gaggggactc agaagtgaat ccaagtggaa 
241 tttaccctgg tcctcgctgg ttttcttctt 
301 tttttcagtt attttatttt actttttatt 
361 ctcctcttct gtcattggag atgaacacag 
421 gactgttgcc cccaaagaga caagcacaca 
481 gagcgcagct caaggcccag agagatcttc 
541 ccgtgtcagt ccgctccgag ccgcggccgc 
601 tagagaagca agtccccccg ctgaacctac 
661 cacactgcta cactgcgctg gggctggttt 
721 gcatggcaac tatacaaaac agtatccggt 
781 cacgcagagg cacaggctgg acatccagat 
841 tgctgctcga gaccatattt atactgttga 
901 ctgtagcaaa aaactgacat ggaaatctag 
961 ggggaaacat aaggatgaat gtcacaactt 
1021 tacgctgttt gtctgtggaa ccaatgcctt 
1081 taccttggaa acttttgggg atgaatttag 
1141 acatgccaac atcgctctgt ttgcagatgg 
1201 tctggccatt gatgcagtca tttacaggag 
1261 caagcatgat tcaaagtggt tgaaagagcc 
1321 ctatatctac ttcttcttca gagaaattgc 
1381 tttccctagg gtggctcagg tctgtaagaa 
1441 gaagcagtgg acatctttcc tgaaggctcg 
1501 tttttatttc aatatactcc aggcagttac 
1561 tgtcttggca accttttcca caccttataa 
1621 tgacatgctt gacattgcta atgttttcac 
1681 ctctacctgg acacccgttc cagacgaacg 
1741 tggatcatcc tctttagaaa aatatgcaac 
1801 cttcattaag acgcatccac tcatggacga 
1861 gttcctgaga acaatggtca gataccgcct 
1921 gccatatcag aatcacactg tggttttcct 
1981 cttggccagg ataggaagca gtggtttcct 
2041 tgtttacaac ccagaaaagt gcagctatga 
2101 gcagctcgac agagcgagtg gctcactcta 
2161 gcctcttggc cgctgtgagc gacatgggaa 
2221 cccgtattgt gggtgggtaa gggaaagtgg 
2281 actgacattt gagcaggaca ttgagcgtgg 
2341 ttccttcgtg gcactgaatg ggcacgccag 
2401 ttcggcatcc cgagacgggt atgagtctag 
2461 cgaggcacct ggcagcacag accctttggg 
2521 gaagggagtg attcgggaaa gttacctcaa 
2581 cctggccatt gcagtcattc tggcttttgt 
2641 gtattgtgtg tgcgatcacc ggcgcaaaga 
2701 gctcactcac tcgcgtcggg gatctatgag 
2761 ggacacccag tccaaggacc caaagcctga 
2821 caagctggcc acgcctagca acaccgccaa 
2881 agacctcacc gccctgccca ccccagagtc 
2941 caaccgcggc agtcgcgagt gggagaggaa 
3001 catgcctccc atgggttccc ctgtgattcc 
3061 ccacatcccc agcgtggtgg tcctgcccat 
3121 agatcagccc aaaatgagcg aggtggtggc 
3181 cctggagtat aagaccatca aagagcacct 
3241 ccttgtggag aacctggaca gcctgccccc 
3301 tcccccggga acctcactgt cacagaccgg 
3361 ctcctcctat gggctcgaat ataagaggag 
3421 ccagaccacc actctcaaaa gaaacaatac 
3481 gaaccagagc tttggccggg gagacaaccc 
3541 ccaggtgcac agctcccagc cctctggcca 
3601 caatgcctac aactcactga cgaggtcggg 
3661 tgtacccccc aaaccttcct ttgctcccct 
3721 tacataatcc caggggttgg ggggccagga 
3781 cccagctgag caaggtctcc attgcctcga 
3841 agctgaggac gctgggtcct cctccctggg 
3901 gtggtctggt gaaggtttag caatccacct 
3961 acagcaggtc ggactcacga aagacttcag 
4021 ctccacccca tcctcacatg aacacaggcc 
4081 cacacacaca cacacacaca cacacaccat 
4141 accacaccac atcacaccac accacacacg 
4201 tgactataca ggacgttacc aaacttggct 
4261 cacacaacca ttttttttta aaaaaaatca 
4321 attgataatt ctaactcaga ctttaacaat 



/db-xref="CDD:smart00423" 



cgcctaggag cgtgcggccg cggccagcgt 
gcagagccgg ccagagccct gcacctctgc 
aatgactagt taatgggcgc tggcaccgcc 
tttggatttg gaaaaagagt ttcttgagca 
ttttctgttt tttccctttt ttctttttct 
tttatttttt atttttggtc ttcctctcac 
ctcgcttgca tcccagaaag taatcgccgt 
tgtaggaatg acaaaggctt gcgaaggaaa 
tcgataatgg attactaaat gggatactcg 
ctgcccgtcg atgcaccgaa aagggtgaag 
tatgcggcca gcagccttac tgctgtgtct 
cccagaagat tccgagccaa tcagtatttc 
gtttgtgggc cacaagccag gacggaacac 
gatcatgatc atgaacagaa ccctctacgt 
tatagacaca tcccacacag aagaaattta 
acaggctgac gtagacacat gcaggatgaa 
cattaaagtt cttctcaaga agaatgatga 
caacccttcc tgcagaaact acagggtcga 
cggaatggcc agatgccctt atgatgccaa 
aaaactctac tcggctacag tgactgactt 
cctcggagac agccctaccc tcaggactgt 
gtactttgtc caagccgtgg attatgggga 
agtagaatac aacactatgg ggaaggttgt 
tgacatggga gggtctcaga gagtcctgga 
cctgaactgc tcggtgcctg gagactctca 
agatgtgatt cgcattaatg gccgtgatgt 
cagcatccca ggttctgcag tctgtgccta 
tgggaggttc aaggaacaga aatcacctga 
agtccctaag cccaggccag gctgttgtgc 
ctccaatgag tttcccgatg ataccctgaa 
ggcagtgcct tccatcatca acagaccttg 
gaccaaaatt gcagtagaca acgctgccgg 
gggatcagaa aagggaatca tcctgaagtt 
aaatggcagc cttttcctgg aggagatgaa 
tggtgtagaa gacaaaagga tcatgggcat 
tgttgcattc tctacttgtg tgatcaaggt 
gtgtaaaaaa acctgcatcg cctccagaga 
ttcctgtgcc catctgtcac cccttagcag 
caatacggac ggcctaggag actgtcacaa 
ttccctctat cccagcacca ctacgtcaga 
gggaggcatg ctggactgga acgacctgct 
ggcagtgtcc tctcataacc accaggacaa 
aagcaacgac cagcttgttc ctgtcaccct 
catgggggcc gtcttctcgg gcatcatcgt 
cgtggcagta gtgcagcgca aggagaaaga 
cagtgtcacc aagctcagtg gcctctttgg 
ggccatcctc acaccactca tgcacaacgg 
gatgctcatc aaggctgacc agcatcacct 
caccccgaca ctgcagcaga aacggaaacc 
ccagaacatc atcaatgcct gcaccaagga 
cacggacctg cccctccggg cctccccaag 
cacgcagcag ggctaccagc acgagtacgt 
tcagatggca ctggaggacc aggctgccac 
gagtagcaag agtcccaacc atggggtgaa 
taaagttcca cagcgcgagg cctccctagg 
cctgagcaag aggctggaga tgcaacactc 
ctaccccacg aactcgctca caagaagcca 
taactcctcc aattcctccc acctctccag 
accccccgcc ccgcagcggg tggactctat 
ggccgtgact gtttcgaggc agcccagcct 
gctgaagcgc accccctcgc taaagccaga 
ttccacatcc atgaagccca atgatgcatg 
gttgaagcat ccgatgaggc gaggcaccca 
gtatccaccc gaccaagatg gccctggagg 
acatagggat actcttcaaa attgggccgt 
ctgcgttctc ttccttcaca tccacgtcat 
tatcatcaca aacatgagcc aaaagcacat 
acagtcacat tcgtgcacta catacacata 
accacaccac accacaccac accacaccac 
caatggtaaa cgacaaccac attttgtccc 
aagcatcaga aactttaaaa cacacacaca 
tgaaaattaa aaaaaaataa taaaggattc 
ggcagaagtt tactatgcgc agatactgtg 



4381 aaatgcccac 
4441 tcatagattt 
4501 cttatttatt 
4561 acatgaacag 
4621 ctggaataat 
4681 ggaaaaaaaa 



cagtgttaca 
ctgctcctct 
gtccacccct 
ctctgcggaa 
tgagtttctt 
aaaaaaaaaa 



gctttgttat 
tcgcttttaa 
tttctttttc 
gcccgtggaa 
tatttttaca 
aa 



agcaggtaaa 
tgaataacgt 
ctcaaggaaa 
agttaaacta 
ataaattcac 



tgccatgttg 
gaccgttaac 
agactcttcc 
tttaacgtga 
tgagtaaata 



ggcaacgatg 
gcaagtaact 
aaatgccacc 
aatccattaa 
atttggagct 
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LOCUS (LOC) : 

GenBank ACC. NO. (GBN) : 

GenBank version (ver) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



KEYWORDS (ST): 
SOURCE : 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AK082711 GenBank (R) 
AK082711 

AK082711 .1 GI : 26349884 

485744-81-8 

3329 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 0 day neonate cerebellum cDNA, RIKEN 
full-length enriched library, clone:c230094Al9 
product :sema domain, transmembrane domain (TM) , and 
cytoplasmic domain, ( *** semaphoring** ) ***sa*** 
, full insert sequence. 
htc ; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 



Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
URL : http : //f antom . gsc . r i ken . j p/ . 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 
OTHER SOURCE (OS) : 

REFERENCE: 

AUTHOR (AU) : 



TITLE (TI); 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI); 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



Carninci ,P. ; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci, P.; Shibata.Y.; Hayatsu,N.; Sugahara,Y. ; 
Shibata,K.; ltoh,M.; Konno,H.; 0kazaki,Y.; 
Muramatsu.M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka,S.; Sasaki, N. ; 
Carninci, P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai,T.; Tashiro.H.; ltoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,S.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto , R . ; Matsumoto , H . ; Sakaguchi , s . ; Ikegami , T . ; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K.; Togawa,Y.; 
Izawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
lshikawa,T.; Ozawa,K.; Tanaka.T. ; Matsuura.s.; 
Kawai,J.; okazaki,Y.; Muramatsu.M. ; lnoue,Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) 



Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3329) 

Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa.T. ; Bono,H.; 
Carninci,P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume,w. ; Hayashida,K. ; Hayatsu.N.; 
Hiramoto,K. ; Hiraoka,T.; Hirozane,T.; Hori,F.; 
Imotani.K.; lshii,Y.; itoh.M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,j.; Kojima,Y. ; Kondo,S.; Konno,H.; 
Kouda,M.; Koya.s.; Kun hara,C. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K.; Numazaki.R.; Ohno,M.; ohsato,N.; okazaki.Y.; 
Saito,R.; Saitoh, H.; Sakai,c.; sakai,K.; sakazume,N.; 
Sano,H.; Sasaki, D. ; shibata,K.; shinagawa,A. 
Shiraki.T.; Sogabe,Y.; Tagami,M. 
Takahashi , F . ; Takaku-Akahi ra , S . ; 
TomarUjA.; Toya,T. ; Yasunishi,A. 
Hayashizaki ,Y. 
Direct Submission 

Submitted (16-APR-2002) Yoshihide Hayashizaki, The 
Institute of Physical and Chemical Research (RIKEN) , 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc . r i ken . j p , URL : http : //genome . gsc . ri ken . j p/ , 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



Tagawa,A. 
Takeda , Y . ; Tanaka , T . 
; Muramatsu.M. ; 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 



source 1..3329 /organism="Mus musculus' 1 

/moT-type="mRNA" 
/strain="c57BL/6J n 
/db-xref =" FANTOM-DB : C2 30094A19" 
/db-xref="taxon : 10090" 
/clone="C230094Al9" 
/ti ssue-type= M ce rebel 1 urn" 
/clone-lib="RlKEN full-length 
enriched mouse cDNA library" 
/dev-stage="0 day neonate" 

CDS 673.-2436 /note="unnamed protein product; 

putative sema domain, 

transmembrane domain (TM) , and 

cytoplasmic domain, (semaphorin) 

6A (MGD | MGI: 1203727, GB | NM-018744, 

evidence: BLASTN, 100%, 

match=1802)" 

/codon-start=l 

/protei n-i d="BAC38582 . 1" 

/db-xref="GI: 26349885" 

/transl ation="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFNPSCRNYRVDTLETFGDEFS 

GMA RC P YDAKH AN I AL FADGK L YS ATVTD F L AI D 

AVI YRS LGDS PTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTS F 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

VLATFSTPYNSIPGSAVCAYDMLD 

IANVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTHPLMDEAVPSIINRPWFLRTMVRYRLTKIAVD 

NAAGPYQNHTWFLGSEKGIILKF 

LARIGSSGFLNGSLFLEEMNVYNPEKCSYDGVED 

KRIMGMQLDRASGSLYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWVRESGSC 

AHLSPLSRLTFEQDIERGNTDGLG 

DCHNSFVALNGKDNVTVTISR" 



SEQUENCE (SEQ) : 



1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 



ggcactcggc 
cccgagatgc 
cactcggggt 
ccccgccctg 
tccaagtgga 
gttttcttct 
tactttttat 
gatgaacaca 
acaagcacac 
gagagatctt 
gccgcggccg 
gctgaaccta 

ggggctggtt 

cagtatccgg 
gacatccaga 
tatactgttg 
tggaaatcta 
tgtcacaact 
accaatgcct 
gatgaattta 
tttgcagatg 
atttacagga 
ttgaaagagc 
agagaaattg 
gtctgtaaga 
ctgaaggctc 
caggcagtta 
acaccttata 
aatgttttca 
ccagacgaac 
aaatatgcaa 
ctcatggacg 
agataccgcc 
gtggttttcc 
agtggtttcc 
tgcagctatg 
ggctcactct 
cgacatggga 
agggaaagtg 
attgagcgtg 
ggtaaggaca 
ctgaatagtt 
tttcaactcc 
ctatatatat 
tcttaatcat 
agattttatt 
gtcttacata 
tatctaatga 
atttaggata 
ataccaaccc 
acagcacaga 
aaaacattca 
agaatgtatg 
tgttctccac 
cacgtgcaag 
attaatttaa 



tgcctaggcg 
ccgcctagga 
tgcagagccg 
aaatgactag 
atttggattt 
tttttctgtt 
ttttattttt 
gctcgcttgc 
atgtaggaat 
ctcgataatg 
cctgcccgtc 
ctatgcggcc 
tcccagaaga 
tgtttgtggg 
tgatcatgat 
atatagacac 
gacaggctga 
tcattaaagt 
tcaacccttc 
gcggaatggc 
gaaaactcta 
gcctcggaga 
cgtactttgt 
cagtagaata 
atgacatggg 
gcctgaactg 
cagatgtgat 
acagcatccc 
ctgggaggtt 
gagtccctaa 
cctccaatga 
aggcagtgcc 
tgaccaaaat 
tgggatcaga 
taaatggcag 
atggtgtaga 
atgttgcatt 
agtgtaaaaa 
gttcctgtgc 
gcaatacgga 
atgttactgt 
taaagtctta 
tctaccagat 
atatattaat 
agaacagatt 
atacttaaaa 
tttaatccat 
attttagaag 
aaagaacagt 
cctaatccgg 
cgccggggtt 
agttcagaaa 
caaggaagca 
ctgaaataca 
ccagaatgca 
aggattgaat 



ctc gggacgc 

gcgtgcggcc 
gccagagccc 
ttaatgggcg 
ggaaaaagag 
ttttcccttt 
tatttttggt 
atcccagaaa 
gacaaaggct 
gattactaaa 
gatgcaccga 
agcagcctta 
ttccgagcca 
ccacaagcca 
catgaacaga 
atcccacaca 
cgtagacaca 
tcttctcaag 
ctgcagaaac 
cagatgccct 
ctcggctaca 
cagccctacc 
ccaagccgtg 
caacactatg 
agggtctcag 
ctcggtgcct 
tcgcattaat 
aggttctgca 
caaggaacag 
gcccaggcca 
gtttcccgat 
ttccatcatc 
tgcagtagac 
aaagggaatc 
ccttttcctg 
agacaaaagg 
ctctacttgt 
aacctgcatc 
ccatctgtca 
cggcctagga 
cacaatttca 
ggtttctttt 
tatgaaatgt 
acttaaagtt 
ctttgttact 
aaaatacatt 
atactatgaa 
gagaagaaat 
gcttataaca 
cctgtcagga 
ttgtgaggca 
gaatttgaat 
tgaggcagtg 
caagcacaga 
gctgttttgg 
gtatttttt 



agcacaggcg 
gcggccagcg 
tgcacctctg 
ctggcaccgc 
tttcttgagc 
tttctttttc 
cttcctctca 
gtaatcgccg 
tgcgaaggaa 
tgggatactc 
aaagggtgaa 
ctgctgtgtc 
atcagtattt 
ggacggaaca 
accctctacg 
gaagaaattt 
tgcaggatga 
aagaatgatg 
tacagggtcg 
tatgatgcca 
gtgactgact 
ctcaggactg 
gattatgggg 
gggaaggttg 
agagtcctgg 
ggagactctc 
ggccgtgatg 
gtctgtgcct 
aaatcacctg 
ggctgttgtg 
gataccctga 
aacagacctt 
aacgctgccg 
atcctgaagt 
gaggagatga 
atcatgggca 
gtgatcaagg 
gcctccagag 
ccccttagca 
gactgtcaca 
cgttgatgga 
taattaattt 
ctttcaacag 
actcatcctt 
tcagggcccc 
ctgcttacat 
acatgccaaa 
ttcttcttga 
gataaactgt 
caacctctcc 
agcatgtgct 
atttaaacat 
agtgtaagat 
tccaagttca 
ttttggtttt 



cttttccgcg 
tcgccacacc 
cctcaacccc 
caaggggact 
atttaccctg 
ttttttcagt 
cctcctcttc 
tgactgttgc 
agagcgcagc 
gccgtgtcag 
gtagagaagc 
tcacactgct 
cgcatggcaa 
ccacgcagag 
ttgctgctcg 
actgtagcaa 
aggggaaaca 
atacgctgtt 
ataccttgga 
aacatgccaa 
ttctggccat 
tcaagcatga 
actatatcta 
ttttccctag 
agaagcagtg 
atttttattt 
ttgtcttggc 
atgacatgct 
actctacctg 
ctggatcatc 
acttcattaa 
ggttcctgag 
ggccatatca 
tcttggccag 
atgtttacaa 
tgcagctcga 
tgcctcttgg 
acccgtattg 
gactgacatt 
attccttcgt 
aatctgttgt 
ttatcatttt 
agtgtatggt 
tggtagtttt 
aattcactaa 
gagacagtta 
gttcaagtat 
atatgcctgt 
agcccctcgg 
aacagggaaa 
gtgtagcatg 
atttccagta 
ccaaacagga 
gggggttatc 
cctagcactt 



gtgccgattg 
cgggacactg 
ccacaccgcc 
cagaagtgaa 
gtcctcgctg 
tattttattt 
tgtcattgga 
ccccaaagag 
tcaaggccca 
tccgctccga 
aagtcccccc 
acactgcgct 
ctatacaaaa 
gcacaggctg 
agaccatatt 
aaaactgaca 
taaggatgaa 
tgtctgtgga 
aacttttggg 
catcgctctg 
tgatgcagtc 
ttcaaagtgg 
cttcttcttc 

ggtggctcag 

gacatctttc 
caatatactc 
aaccttttcc 
tgacattgct 
gacacccgtt 
ctctttagaa 
gacgcatcca 
aacaatggtc 
gaatcacact 
gataggaagc 
cccagaaaag 
cagagcgagt 
ccgctgtgag 
tgggtgggta 
tgagcaggac 
ggcactgaat 
actaatccac 
tttatagaca 
cagtttattg 
tagttttcat 
acttaatttt 
aggagaccaa 
tgggtaaatc 
atatagagag 
ctgataggga 
ttcccatata 
aaaatggcca 
agacttcttc 
ttttgcccat 
cttacttcct 
ccaaacacct 
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LOCUS (LOC): 

GenBank acc. NO. (gbn) 

GenBank version (ver) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



KEYWORDS (ST) : 
SOURCE : 
ORGANISM (ORGN) 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AK052232 GenBank (R) 
AK052232 

AK052232.1 GI: 26342457 

492936-40-0 

4476 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 13 days embryo heart cDNA, RIKEN 
full-length enriched library, clone :D330011G23 
product: weakly similar to ***semaphorin*** 

*** 6A *** PRECURSOR ( ***SEMAPHORIN*** *** V IA*** 
) (SEMA VIA) (SEMAPHORIN Q) (SEMA Q) [Mus musculus], 
full insert sequence. 
HTC; cap trapper 
Mus musculus (house mouse) 
Mus musculus 



Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

CDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues, 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . jp/ 
URL : http : / / f antom . gsc . r i ken . j P/ • 



(OS) 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 
OTHER SOURCE 

REFERENCE: 

AUTHOR (AU) : 



TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 



Carninci,P.; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol . , 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; shibata,Y.; Hayatsu,N.; Sugahara,Y.; 
shibata,K.; ltoh,M.; Konno,H.; okazaki ,Y. ; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka,s.; Sasaki, N.; 
Carninci.P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai,T.; Tashiro.H.; ltoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine.T.; Harada.A.; 
Yamamoto,R.; Matsumoto,H. ; Sakaguchi ,s. ; lkegami,T.; 
Kashiwagi ,K. ; Fujiwake,S.; Inoue,K.; Togawa,Y. ; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
Ishikawa,T.; Ozawa.K. ; Tanaka,T.; Matsuura,S.; 
Kawai,J.; Okazaki ,Y. ; Muramatsu,M. ; lnoue,Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cdna 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 4476) 

Adachi.J.; Aizawa,K.; Akimura,T.; Arakawa,T. ; Bono,H.; 
carninci,P.; Fukuda,S.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume ,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane.T.; Hori,F.; 
lmotani,K.; Ishii.Y.; ltoh,M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima.Y. ; Kondo,s.; Konno,H.; 
Kouda,M.; Koya,s.; Kunhara,c.; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K. ; Numazaki,R.; ohno,M.; Ohsato,N.; okazaki, Y. ; 
Saito,R.; Saitoh, H.; Sakai.C; Sakai,K.; Sakazume,N.; 
Sano,H.; Sasaki, D.; Shibata,K.; shinagawa, A. ; 
Shi raki ,T. ; Sogabe , Y . ; Tagami , M . ; Tagawa , A . ; 
Takahashi ,F. ; Takaku-Akahi ra,S. ; Takeda.Y.; Tanaka,T. ; 
Tomaru,A.; Toya,T.; Yasunishi ,A. ; Muramatsu ,M. ; 
Hayashizaki ,Y. 
Direct submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
Institute of Physical and chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences center (GSC), RIKEN Yokohama 



institute; 1-7-22 suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . i p/ , 
Tel : 81-45-503-9222 , Fax : 81-45-503-9216) 

FEATURES (FEAT): 

Feature Key Location Qualifier 

source 1..4476 /orqanism="Mus musculus" 

/mol-type= ,, mRNA" 
/strain="C57BL/6J" 
/db-xref="FANTOM-DB:D33001lG23" 
/db-xref="taxon : 10090" 
/clone= M D33001lG23" 
/tissue-type=" heart" 
/clone-lib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="13 days embryo 

CDS 228.. 2015 /note="unnamed protein product; 

putative weakly similar to 

SEMAPHORIN 6A PRECURSOR 

(SEMAPHORIN VIA) (SEMA VIA) 

(SEMAPHORIN Q) (SEMA Q) [MUS 

musculus] (SWISSPROT | 035464, 

evidence: FASTY, 60.6%ID, 

64.8%length, match=1719)" 

/codon-start=l 

/protei n-i d=" BAC34891 . 1" 

/db-xref="GI: 26342458" 

/transl ation="MGFLLLWFCVLFLLVSRLRA 

VS F P EDDE P LNTVDYH YS RQY PVF 

RGRPSGNESQHRLDFQLMLKIRDTLYIAGRDQVY 

TVN LNEI PQTEVI PSKKLTWRSRQ 

QDRENCAMKGKHKDECHNFIKVFVPRNDEMVFVC 

GTNAFN PMCRYYRLRTLEYDGEEI 

SGLARCPFDARQTNVALFADGKLYSATVADFLAS 

DAVIYRSMGDGSALRTIKYDSKWI 

KEPHFLHAIEYGNYVYFFFREIAVGHNNLGKAVY 

SRVARICKNDMGGSQRVLEKHWTS 

FLKARLNCSVPGDSFFYFDVLQSITDIIQINGIP 

TWGVFTTQLNSIPGSAVCAFSMD 

DIEKVFKGRFKEQKTPDSVWTAVPEDKVPKPRPG 

CCAKHGLAEAYKTSIDFPDDTLAF 

IKS H PLMDS AVP PI ADE PWFTKTRVR YR LTAI EV 

DRSAGPYQNYTVIFVGSEAGWLK 

VLAKTSPFSLNDSVLLEEIEAYN PAKCSAESEED 

RKWSLQLDKDHHALYVAFSSCW 

RIPLSRCERYGSCKKSCIASRDPYCGWLSQGVCE 

RVTLGMLPGGYEQDTEYGNTAHLG 

DCHGKTDASLASATPLDRICTQTLFYKQP" 

SEQUENCE (SEQ): 

1 ggccgccgcc gcagccgccg ccagggctgg agcgagtgag tttcaggata gatcgcaaga 
61 cgagacccga gaccgtggga ttgctctttc tgcacactgt cttgattggc tcttctgtct 
121 gaccgtgaca agcttcattt gtctgttttt aaaagcaggc ttccccatga gaggtagcta 
181 agtgggactt ctgaggaggg ggagctcaga tttcttcttg gccaaccatg gggttccttc 
241 tgctttggtt ctgcgtgctg ttccttctgg tctccaggtt acgggcggtc agcttcccag 
301 aagacgatga gcccctcaac acggttgact atcactattc aaggcaatat ccggttttta 
361 gaggacgccc ttcaggcaac gaatcgcagc acaggctgga ctttcagctg atgttgaaaa 
421 ttcgagacac actttatatt gctggcaggg atcaagtcta tacagtgaac ttaaatgaaa 
481 tcccccaaac agaggtgata ccaagcaaga agctgacgtg gaggtccaga cagcaggatc 
541 gagaaaattg tgctatgaaa ggcaagcata aagatgaatg ccacaacttc atcaaagtct 
601 ttgtcccaag aaatgatgag atggtttttg tctgtggtac caatgctttc aacccgatgt 
661 gcagatacta taggttgaga acgttagagt atgatgggga agaaattagt ggcctggcac 
721 gatgcccgtt tgatgcccga caaaccaatg tcgccctctt tgctgatgga aaactctatt 
781 ctgccacagt ggctgatttc ctggccagtg atgctgtcat ttacagaagc atgggagatg 
841 gatctgccct tcgcacaata aaatacgatt ccaagtggat caaagaacca cacttccttc 
901 atgccataga atatggaaac tatgtctatt tcttcttcag agaaatcgcc gtgggacata 
961 ataacttagg caaggctgtg tattcccgcg tggctcgcat ttgtaaaaac gacatgggtg 
1021 gctcacagcg ggtcctggag aaacactgga cttccttcct taaggctcgg ctgaactgct 
1081 ccgttcctgg agattccttt ttctacttcg acgtcctgca gtctataaca gacataatcc 
1141 aaatcaatgg catccccact gtggttgggg tcttcaccac acagctcaac agcattcctg 
1201 gttctgcagt ctgtgccttt agcatggacg acattgagaa agtgttcaaa gggcggttca 
1261 aagagcagaa aaccccagac tctgtttgga cagcagttcc cgaagacaaa gtaccaaaac 
1321 caaggcctgg ctgttgtgcc aaacacggcc tcgcagaagc ttacaagacc tccatcgact 
1381 ttccagatga caccctggct ttcatcaagt cccacccgct gatggactct gccgtcccac 



1441 ccattgccga tgagccctgg ttcacaaaga cacgggtcag gtacaggttg acagccatcg 

1501 aagtggaccg ttcagcaggg ccataccaaa actacacagt catctttgtt ggctctgaag 
1561 ctggcgtggt acttaaagtt ttggcaaaga ccagtccttt ctctctgaat gacagtgtat 

1621 tactcgaaga gattgaagct tataacccag ccaagtgcag cgccgagagt gaggaggaca 

1681 gaaaggtggt ctcattacag ctggacaagg atcaccatgc tttatacgtg gccttctcta 

1741 gctgcgtggt ccgcatcccc ctcagccgct gtgagcgcta cggatcgtgt aaaaagtctt 

1801 gcattgcatc acgtgacccg tactgtggtt ggttaagcca gggagtttgt gagagagtga 

1861 ccctagggat gctccctgga ggatatgagc aggacacgga gtacggcaac acagcccacc 

1921 taggggactg ccacggtaag acagacgcct ccctggcctc tgctactcct ttggaccgca 

1981 tttgcacgca aacattattt tacaagcagc cttgataatg actcgggaca catagcacct 

2041 agcatttaac gtaaacttgg aaaaagctga gctctttggt catggggtcc attagaagag 

2101 attctgactt ttttcttttc cttcgccatt tgcacttttg taaataaaca acagtctaga 

2161 gaaaggtagt gtaaccagta aacttctagt ttgtagtctt ttctttggac ttttctctgt 

2221 tacttggagt tcgtgaactt ttttgctttc tttccgttac tcttcactga tgacgtgttc 

2281 ctttaatccc tttagaaagt ttgcctcctt caactacacc agattacaaa atatttggcg 

2341 gtccaacatc tggttagttt tttttaattt ttttgaatta acagtctttt ctttccttca 

2401 gtttagcatt ttctgccctc ttccccaccc ccctcctttt gcgcagtttg gcagccctcc 

2461 atttttctgc tttgactcat aaccctctga aggtcctgag agatgttttc ttactcagcc 

2521 ctccacccgt gtgtagcatg gaatggttca tcattgacca aggcacatcc ttaaggaaag 

2581 tccaatgggg tgacatattc tgcatttcaa ttgcacgcca gccacccagc tcccttctct 

2641 cgaccctttg cattgtgaat gctccctctg gccatctcta tcccatggat gacagaattg 

2701 cataattttc cccttctctc cccacctttc atacaaacaa ggaatgaaca atacacattt 

2761 ttcaaaaatg tgtattttta agttctaaag atcttttttt gaatgttttt gtggcaatcc 

2821 caggaatctc tgaatgttgt gttctcagtg gtgtgaaaga agctagcttc agttgaatga 

2881 agctggattt tgattttcta agatgtaccg tgcacatcta tctgtgtttg tttcgaggtc 

2941 tcttcctgta actattaggt cagtccgtac catgtctaaa agcgtatgct tatatattca 

3001 tggacatcta ctaggctaaa gctcgcaata gacacggaaa ggttcagttg ctcctcagac 

3061 tatgctgcat taggggtcac atcagttcca tctctcaggg gggagaaaag gatggcctgt 

3121 ctcatgtatc ccaacatcct tattcagtgt atacatacca ttgcttgtgc atgggttact 

3181 gctaactctt tggaagaaac tccccaagta tttcggagaa cacagcaagg tttgtggatg 

3241 gagtctcacg gtgcattgag aaacctacag atggacagag aggccttggg gaagcagaat 

3301 ggcggcctct taggcagaac aggtttatca ggtccaagac caactgttta cttggacctg 

3361 accagatatt ttctaaatgt aattccatta tagcttctcc ggggaaggaa agcgtataga 

3421 gaccccagcc gttcacaaag gtcctctgta cattacacat cacaaataaa tgatcccgag 

3481 agctagctgg agagagccag agtctgccca agttggggct taggatactt ttccacgagg 

3541 aggcttcatt aggaatgttc acctgtcttt tactttggaa atctgcataa tgtgtcttca 

3601 ttgtcagctt ttagatttta cacctctctt gcaagaagaa aaaaaaaatc tggtttcgtt 

3661 cccctgaggg ggcttttacc aaattctagg acaggatttc tgagaatacg tacttgactt 

3721 tcaaatgcaa ggatgcaatc agaaaagtct tcttgacgct agccatgtta atggtcatct 

3781 gtttgccacc acagacatgg aggtatcctc atcttctgtt accactgtgg caagtagccc 

3841 agaaattaca tctaaagtga ttgatacctg gagacctaaa ctgacgagct cccggaaatt 

3901 tgtagttcaa gatgacccaa atacttctga ttttactgat actatatcag gtatcccaaa 

3961 gggtaaggcc tggccaggca cttcatctct gactgcatgg aaacctgatc cctagcctgg 

4021 ctctcagagg tgaaactaaa gtcgagcagc agtccctctg gccacagtga ctttgctctc 

4081 ctgtggtttt taagtctgag tacattcctg cccacgacat ggctcctgag acagccactc 

4141 agacaatcta atctgtattg ggttctgctg gcagatgctg agagcctgct gaatatcaga 

4201 ggattctgag ggtcaccctc tcaatatcaa aggacaaaag ctcgcagatg acaaatacct 

4261 gttaccaaat atttttatct gggaggctat ctcagcccct tcttagcttc atctgcagat 

4321 gtaatgtgga aaggatatgc agggtgacaa gttgtccaaa tgccacaagt ggcaaagggt 

4381 ggcatagtac ttgtctgtaa ttactacgtt cagcccataa taggtgaagt ttaaaagctg 

4441 tatgatcaat ttgcttgtaa gatttaatac tctagg 
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locus (loc): AK042751 GenBank (R) 

GenBank ACC. NO. (GBN) : AK042751 

GenBank VERSION (VER) : AK042751.1 GI:26335300 

CAS REGISTRY NO. (RN) : 492901-03-8 

SEQUENCE LENGTH (SQL): 3226 

MOLECULE type (ci) : mRNA; linear 

DIVISION CODE (CI): Hi gh-Throughput CDNA Sequencing 
DATE (DATE): 3 Apr 2004 

DEFINITION (DEF): Mus musculus 7 days neonate cerebellum cdna, riken 

full-length enriched library, clone:A730020P05 
product:sema domain, transmembrane domain (tm) , and 
cytoplasmic domain, ( ***semaphorin*** ) ***6a*** 
, full insert sequence. 

keywords (ST): HTC; CAP trapper 

SOURCE: Mus musculus (house mouse) 

ORGANISM (ORGN): Mus musculus 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 



Genomic sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . jp/ 
URL : http : //f antom . gsc . ri ken . jp/ . 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 
OTHER SOURCE (OS) : 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO) 



carninci,p.; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; Shibata,Y.; Hayatsu,N.; Sugahara,Y.; 
Shibata,K.; ltoh,M.; Konno,H.; Okazaki ,Y.; 
Muramatsu.M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

shibata,K.; ltoh,M.; Aizawa.K.; Nagaoka.s.; Sasaki, N.; 
carninci,P.; Konno,H.; Akiyama,j.; Nishi,K.; 
Kitsunai,T.; Tashiro,H.; ltoh,M.; sumi,N.; lshii,Y.; 
Nakamura.S.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto,R.; Matsumoto,H. ; Sakaguchi ,S. ; Ikegami,T.; 
Kashiwagi ,K. ; Fujiwake,S.; lnoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y. ; 
Ishikawa,T.; ozawa,K.; Tanaka,T. ; Matsuura,s.; 
Kawai,J.; okazaki ,Y. ; Muramatsu,M. ; Inoue,Y.; Kira,A.; 
Hayashizaki , Y. 

riken integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3226) 

Adachi,J.; Aizawa,K.; Akimura,T. ; Arakawa.T. 
Carninci,P.; Fukuda,s.; Furuno,M.; Hanagaki,T. 
Hara,A.; Hashizume,w. ; Hayashida,K. ; Hayatsu.N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane.T.; Hori,F.; 
lmotani,K.; lshii,Y.; Itoh,M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima,Y.; Kondo.s.; Konno,H.; 
Kouda.M.; Koya,s.; Kunhara.C. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura,K.; Numazaki,R.; ohno.M.; ohsato,N.; okazaki ,Y. 
Saito,R.; Saitoh, H.; Sakai,C; Sakai,K.; sakazume,N.; 
Sano,H.; Sasaki, D.; shibata,K.; shinagawa,A. ; 
Shiraki,T.; sogabe,Y.; Tagami,M.; Tagawa,A. : 
Takahashi ,F. ; Takaku-Akahi ra,S . ; 
Tomaru,A.; Toya,T. ; Yasunishi.A. 
Hayashizaki , Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, riken 
Genomic Sciences Center (GSC), RIKEN Yokohama 
institute; 1-7-22 suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc. riken. jp, URL : http: //genome. gsc. riken. jp/, 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



Bono,H. 



Takeda , Y . ; Tanaka , T . 
; Muramatsu,M. ; 



FEATURES (FEAT) 



Feature Key Location 
source 1..3226 



CDS 



671. .2488 



Qualifier 

/organism="Mus musculus" 

/mol-type="mRNA n 

/strain= M C57BL/6J n 

/db-xref= M FANTOM-DB:A730020P05" 

/db-xref= M taxon: 10090" 

/clone="A730020P05" 

/tissue-type="cereben um M 

/ cl one-1 i b="RIKEN full-length 

enriched mouse cDNA library" 

/dev-stage="7 days neonate 

/not e=" unnamed protein product; 

putative sema domain, 

transmembrane domain (TM) , and 

cytoplasmic domain, (semaphorin) 

6A (MGD I MGI: 1203727, GB | NM-018744, 

evidence: BLASTN, 100%, 

match=1802)" 

/codon-start=l 

/protein-id="BAC31351.1" 

/db-xref= n Gl: 26335301" 

/translation="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFN PSCRNYRVDTLETFGDEFS 

GMARC P YDAKH AN IALFADGK LYSATVTD F LAI D 

AVIYRS LGDS PTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTS F 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

VLATFSTPYNSIPGSAVCAYDMLD 

IANVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTH P LMDE AV PS 1 1 N R PWF L RTMVR YR LTK I AVD 

NAAGPYQNHTWFLGSEKGIILKF 

LARIGSSGFLNGS LFLEEMNVYN PEKCSYDGVED 

KRIMGMQLDRASGS LYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWVRESGSC 

AHLSPLSRLTFEQDIERGNTDGLG 

DCHNSFVALNDISTPLPDYEMSFNRVYGQFIAIY 

IYINT" 



SEQUENCE (SEQ) : 

1 gactcggctg cctaggcgct cgggacgcag cacaggcgct tttccgcggt gccgattgcc 

61 cgagatgccc gcctaggagc gtgcggccgc ggccagcgtc gccacacccg ggacactgca 

121 ctcggggttg cagagccggc cagagccctg cacctctgcc tcaacccccc acaccgcccc 

181 ccgccctgaa atgactagtt aatgggcgct ggcaccgccg aggggactca gaagtgaatc 

241 caagtggaat ttggatttgg aaaaagagtt tcttgagcat ttaccctggt cctcgctggt 

301 tttcttcttt tttctgtttt ttcccttttt tctttttctt ttttcagtta ttttatttta 

361 ctttttattt ttatttttta tttttggtct tcctctcacc tcctcttctg tcattggaga 

421 tgaacacagc tcgcttgcat cccagaaagt aatcgccgtg actgttgccc ccaaagagac 

481 aagcacacat gtaggaatga caaaggcttg cgaaggaaag agcgcagctc aaggcccaga 

541 gagatcttct cgataatgga ttactaaatg ggatactcgc cgtgtcagtc cgctccgagc 

601 cgcggccgcc tgcccgtcga tgcaccgaaa agggtgaagt agagaagcaa gtccccccgc 

661 tgaacctact atgcggccag cagccttact gctgtgtctc acactgctac actgcgctgg 

721 ggctggtttc ccagaagatt ccgagccaat cagtatttcg catggcaact atacaaaaca 

781 gtatccggtg tttgtgggcc acaagccagg acggaacacc acgcagaggc acaggctgga 

841 catccagatg atcatgatca tgaacagaac cctctacgtt gctgctcgag accatattta 

901 tactgttgat atagacacat cccacacaga agaaatttac tgtagcaaaa aactgacatg 

961 gaaatctaga caggctgacg tagacacatg caggatgaag gggaaacata aggatgaatg 

1021 tcacaacttc attaaagttc ttctcaagaa gaatgatgat acgctgtttg tctgtggaac 

1081 caatgccttc aacccttcct gcagaaacta cagggtcgat accttggaaa cttttgggga 

1141 tgaatttagc ggaatggcca gatgccctta tgatgccaaa catgccaaca tcgctctgtt 

1201 tgcagatgga aaactctact cggctacagt gactgacttt ctggccattg atgcagtcat 

1261 ttacaggagc ctcggagaca gccctaccct caggactgtc aagcatgatt caaagtggtt 

1321 gaaagagccg tactttgtcc aagccgtgga ttatggggac tatatctact tcttcttcag 

1381 agaaattgca gtagaataca acactatggg gaaggttgtt ttccctaggg tggctcaggt 

1441 ctgtaagaat gacatgggag ggtctcagag agtcctggag aagcagtgga catctttcct 

1501 gaaggctcgc ctgaactgct cggtgcctgg agactctcat ttttatttca atatactcca 

1561 ggcagttaca gatgtgattc gcattaatgg ccgtgatgtt gtcttggcaa ccttttccac 

1621 accttataac agcatcccag gttctgcagt ctgtgcctat gacatgcttg acattgctaa 

1681 tgttttcact gggaggttca aggaacagaa atcacctgac tctacctgga cacccgttcc 

1741 agacgaacga gtccctaagc ccaggccagg ctgttgtgct ggatcatcct ctttagaaaa 



1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 



atatgcaacc 
catggacgag 
ataccgcctg 
ggttttcctg 
tggtttccta 
cagctatgat 
ctcactctat 
acatgggaag 
ggaaagtggt 
tgagcgtggc 
catttcaact 
tgctatatat 
attcttaatc 
ttagatttta 
aagtcttaca 
tctatctaat 
agatttagga 
gaataccaac 
taacagcaca 
caaaaacatt 
tcagaatgta 
attgttctcc 
ctcacgtgca 
ctattaattt 



tccaatgagt 
gcagtgcctt 
accaaaattg 
ggatcagaaa 
aatggcagcc 
ggtgtagaag 
gttgcattct 
tgtaaaaaaa 
tcctgtgccc 
aatacggacg 
cctctaccag 
atatatatta 
atagaacaga 
ttatacttaa 
tatttaatcc 
gaattttaga 
taaaagaaca 
cccctaatcc 
gacgccgggg 
caagttcaga 
tgcaaggaag 
acctgaaata 
agccagaatg 
aaaggattga 



ttcccgatga 
ccatcatcaa 
cagtagacaa 
agggaatcat 
ttttcctgga 
acaaaaggat 
ctacttgtgt 
cctgcatcgc 
atctgtcacc 
gcctaggaga 
attatgaaat 
atacttaaag 
ttctttgtta 
aaaaaataca 
atatactatg 
aggagaagaa 
gtgcttataa 
ggcctgtcag 
ttttgtgagg 
aagaatttga 
catgaggcag 
cacaagcaca 
cagctgtttt 
atgtattttt 



taecctgaac 
cagaccttgg 
cgctgccggg 
cctgaagttc 
ggagatgaat 
catgggcatg 
gatcaaggtg 
ctccagagac 
ccttagcaga 
ctgtcacaat 
gtctttcaac 
ttactcatcc 
cttcagggcc 
ttctgcttac 
aaacatgcca 
atttcttctt 
cagataaact 
gacaacctct 
caagcatgtg 
atatttaaac 
tgagtgtaag 
gatccaagtt 
ggttttggtt 
taaaaaagaa 



ttcattaaga 
ttcctgagaa 
ccatatcaga 
ttggccagga 
gtttacaacc 
cagctcgaca 
cctcttggcc 
ccgtattgtg 
ctgacatttg 
tccttcgtgg 
agagtgtatg 
tttggtagtt 
ccaattcact 
atgagacagt 
aagttcaagt 
gaatatgcct 
gtagcccctc 
ccaacaggga 
ctgtgtagca 
atatttccag 
atccaaacag 
cagggggtta 
ttcctagcac 
agaaag 



cgcatccact 
caatggtcag 
atcacactgt 
taggaagcag 
cagaaaagtg 
gagcgagtgg 
gctgtgagcg 
ggtgggtaag 
agcaggacat 
cactgaatga 
gtcagtttat 
tttagttttc 
aaacttaatt 
taaggagacc 
attgggtaaa 
gtatatagag 
ggctgatagg 
aattcccata 
tgaaaatggc 
taagacttct 
gattttgccc 
tccttacttc 
ttccaaacac 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



AK084922 
AK084922 

AK084922.1 Gl:26102335 
492839-41-5 
3921 

mRNA ; linear 

High-Throughput cdna Sequencing 
3 Apr 2004 

Mus musculus 13 days embryo lung cDNA, RIKEN 
full-length enriched library, clone :D430012F08 
product: weakly similar to ***SEMAPHORIN*** 

***6A*** PRECURSOR ( ***SEMAPHORIN*** ***VIA 
) (SEMA VIA) (SEMAPHORIN Q) (SEMA Q) [Mus musculus], 
full insert sequence. 
HTC; cap trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
URL : http : //f antom . gsc . ri ken . jp/ . 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN) 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 
OTHER SOURCE (OS) 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS): 
REFERENCE: 

AUTHOR (AU) : 



carninci,P.; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; Shibata,Y.; Hayatsu,N.; Sugahara.Y.; 
Shibata,K.; Itoh,M.; Konno,H.; okazaki.Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

shibata,K.; ltoh,M.; Aizawa,K.; Nagaoka,s.; Sasaki, N. 
carninci,P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai.T.; Tashiro,H.; ltoh,M.; sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A.; 
Yamamoto,R. ; Matsumoto,H. ; sakaguchi ,s. ; Ikegami ,T. ; 



TITLE (TI) 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO) : 



Kashiwagi ,K. ; Fujiwake,S.; Inoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki ,M. ; Yoneda,Y.; 
ishikawa.T.; ozawa,K.; Tanaka,T.; Matsuura,S . ; 
Kawai,j.; okazaki ,Y. ; Muramatsu ,M. ; inoue.Y.; Kira.A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3921) 

Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa,T.; Bono,H.; 
Carninci,P.; Fukuda,S.; Furuno,M.; Hanagaki,T.; 
Hara,A_; Hashizume,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
lmotani,K.; lshii,Y.; ltoh,M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima,Y. ; Kondo,s.; Konno.H.; 
Kouda,M.; Koya,s.; Kunhara,c. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K.; Numazaki,R.; Ohno,M.; Ohsato,N.; okazaki, Y. ; 
Saito,R.; Saitoh, H.; Sakai.C; Sakai,K.; Sakazume,N.; 
Sano,H.; Sasaki, D.; Shibata,K.; shinagawa,A. ; 
shiraki,T.; Sogabe,Y.; Tagami,M.; Tagawa,A.; 
Takahashi ,F. ; Takaku-Akahi ra,s . ; Takeda,Y.; Tanaka,T.; 
Tomaru,A.; Toya,T.; Yasunishi , A, ; Muramatsu, M. ; 
Hayashizaki , Y. 
Direct Submission 

Submitted (16-APR-2002) Yoshihide Hayashizaki, The 
institute of Physical and chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- 
res@gsc. riken.jp, URL: http://genome.gsc. riken.jp/, 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT) 
Feature Key 



source 



Location 
.3921 



misc-feature 



1. .3921 



SEQUENCE (SEQ): 

1 gagtttcttc 
61 cttcctcgct 
121 ccagcttcgg 
181 gcggggaccg 
241 gctttgttct 



Qualifier 



/orqanism= M Mus musculus" 
/mol-type= ,, mRNA M 
/strain= ,, c57BL/6J ,, 
/db-xref="FANTOM-DB:D430012F08" 
/db-xref= M taxon: 10090" 
/clone= M D430012F08" 
/tissue-type= f, lung" 
/clone-lib="RlKEN full-length 
enriched mouse cDNA library" 
/dev-stage="13 days embryo ' 
/note="weakly similar to 
SEMAPHORIN 6A PRECURSOR 
(SEMAPHORIN VIA) (SEMA VIA) 
(SEMAPHORIN Q) (SEMA Q) [MUS 

musculus] (SWISSPROT | 035464, 
evidence: FASTY, 60.6%ID, 
64.8%length, match=1719)" 



ttggcagggc 
gcggggcttg 
ccgggagaac 
tgagcaccca 
ctggctgaaa 



cggctcggcg 
gagcctccag 
gcagggacag 
ctcgcgggtc 
aaatgctcca 



ccgctccgca 
cccgcccgca 
cctgggagtg 
tcccagccac 
cagaagcgcc 



agccgtcagg 
gcgcagcccg 
cggagccact 
tctcccccgc 
atagaccggg 



gtgagcccgc 
ggccgccggg 
gactgtccac 
aactttgcgt 
tctcctccag 



301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 



gcggcaaggg 
aaacaaataa 
gtagcctgga 
tgaggagggg 
tgcgtgctgt 
cccctcaaca 
tcaggcaacg 
ctttatattg 
gaggtgatac 
gctatgaaag 
aatgatgaga 
aggttgagaa 
gatgcccgac 
gctgatttcc 
cgcacaataa 
tatggaaact 
aaggctgtgt 
gtcctggaga 
gattcctttt 
atccccactg 
tgtgccttta 
accccagact 
tgttgtgcca 
accctggctt 
gagccctggt 
tcagcagggc 
cttaaagttt 
attgaagctt 
tcattacagc 
cgcatccccc 
cgtgacccgt 
ctccctggag 
cacggtaaga 
acattatttt 
taaacttgga 
tttcttttcc 
taaccagtaa 
cgtgaacttt 
ttagaaagtt 
ggttagtttt 
tctgccctct 
ttgactcata 
tgtagcatgg 
gacatattct 
attgtgaatg 
ccttctctcc 
gtatttttaa 
gaatgttgtg 
gattttctaa 
ctattaggtc 
taggctaaag 
aggggtcaca 
caacatcctt 
ggaagaaact 
tgcattgaga 
aggcagaaca 
tctaaatgta 
ttcacaaagg 
gagagccaga 
ggaatgttca 
tagattttac 



ctcggccggg 
ggaaaagtca 
gcagaggcag 
gagctcagat 
tccttctggt 
cggttgacta 
aatcgcagca 
ctggcaggga 
caagcaagaa 
gcaagcataa 
tggtttttgt 
cgttagagta 
aaaccaatgt 
tggccagtga 
aatacgattc 
atgtctattt 
attcccgcgt 
aacactggac 
tctacttcga 
tggttggggt 
gcatggacga 
ctgtttggac 
aacacggcct 
tcatcaagtc 
tcacaaagac 
cataccaaaa 
tggcaaagac 
ataacccagc 
tggacaagga 
tcagccgctg 
actgtggttg 
gatatgagca 
cagacgcctc 
acaagcagcc 
aaaagctgag 
ttcgccattt 
acttctagtt 
tttgctttct 
tgcctccttc 
ttttaatttt 
tccccacccc 
accctctgaa 
aatggttcat 
gcatttcaat 
ctccctctgg 
ccacctttca 
gttctaaaga 
ttctcagtgg 
gatgtaccgt 
agtccgtacc 
ctcgcaatag 
tcagttccat 
attcagtgta 
ccccaagtat 
aacctacaga 
ggtttatcag 
attccattat 
tcctctgtac 
gtctgcccaa 
cctgtctttt 
acctctcttg 



agacgtgtta 
attttgtatg 
gcaaaccgag 
ttcttcttgg 
ctccaggtta 
tcactattca 
caggctggac 
tcaagtctat 
gctgacgtgg 
agatgaatgc 
ctgtggtacc 
tgatggggaa 
cgccctcttt 
tgctgtcatt 
caagtggatc 
cttcttcaga 
ggctcgcatt 
ttccttcctt 
cgtcctgcag 
cttcaccaca 
cattgagaaa 
agcagttccc 
cgcagaagct 
ccacccgctg 
acgggtcagg 
ctacacagtc 
cagtcctttc 
caagtgcagc 
tcaccatgct 
tgagcgctac 
gttaagccag 
ggacacggag 
cctggcctct 
ttgataatga 
ctctttggtc 
gcacttttgt 
tgtagtcttt 
ttccgttact 
aactacacca 
tttgaattaa 
cctccttttg 
ggtcctgaga 
cattgaccaa 
tgcacgccag 
ccatctctat 
tacaaacaag 
tctttttttg 
tgtgaaagaa 
gcacatctat 
atgtctaaaa 
acacggaaag 
ctctcagggg 
tacataccat 
ttcggagaac 
tggacagaga 
gtccaagacc 
agcttctccg 
attacacatc 
gttggggctt 
actttggaaa 



cgctctcagc 
agccgcgtct 
cccaacaacc 
ccaaccatgg 
cgggcggtca 
aggcaatatc 
tttcagctga 
acagtgaact 
aggtccagac 
cacaacttca 
aatgctttca 
gaaattagtg 
gctgatggaa 
tacagaagca 
aaagaaccac 
gaaatcgccg 
tgtaaaaacg 
aaggctcggc 
tctataacag 
cagctcaaca 
gtgttcaaag 
gaagacaaag 
tacaagacct 
atggactctg 
tacaggttga 
atctttgttg 
tctctgaatg 
gccgagagtg 
ttatacgtgg 
ggatcgtgta 
ggagtttgtg 
tacggcaaca 
gctactcctt 
ctcgggacac 
atggggtcca 
aaataaacaa 
tctttggact 
cttcactgat 
gattacaaaa 
cagtcttttc 
cgcagtttgg 
gatgttttct 
ggcacatcct 
ccacccagct 
cccatggatg 
gaatgaacaa 
aatgtttttg 
gctagcttca 
ctgtgtttgt 
gcgtatgctt 
gttcagttgc 
ggagaaaagg 
tgcttgtgca 
acagcaaggt 
ggccttgggg 
aactgtttac 
gggaaggaaa 
acaaataaat 
aggatacttt 
tctgcataat 



tccccatcaa 
ttcgcagcca 
aggtagctaa 
ggttccttct 
gcttcccaga 
cggtttttag 
tgttgaaaat 
taaatgaaat 
agcaggatcg 
tcaaagtctt 
acccgatgtg 
gcctggcacg 
aactctattc 
tgggagatgg 
acttccttca 
tggaacataa 
acatgggtgg 
tgaactgctc 
acataatcca 
gcattcctgg 
ggcggttcaa 
taccaaaacc 
ccatcgactt 
ccgtcccacc 
cagccatcga 
gctctgaagc 
acagtgtatt 
aggaggacag 
ccttctctag 
aaaagtcttg 
agagagtgac 
cagcccacct 
tggaccgcat 
atagcaccta 
ttagaagaga 
cagtctagag 
tttctctgtt 
gacgtgttcc 
tatttggcgg 
tttccttcag 
cagccctcca 
tactcagccc 
taaggaaagt 
cccttctctc 
acagaattgc 
tacacatttt 
tggcaatccc 
gttgaatgaa 
ttcgaggtct 
atatattcat 
tcctcagact 
atggcctgtc 
tgggttactg 
ttgtggatgg 
aagcagaatg 
ttggacctga 
gcgtatagag 
gatcccgaga 
tccacgagga 
gtgtcttcat 



ttatggatgg 
gaggcgtctt 
gtgggacttc 
gctttggttc 
agacgatgag 
aggacgccct 
tcgagacaca 
cccccaaaca 
agaaaattgt 
tgtcccaaga 
cagatactat 
atgcccgttt 
tgccacagtg 
atctgccctt 
tgccatagaa 
taacttaggc 
ctcacagcgg 
cgttcctgga 
aatcaatggc 
ttctgcagtc 
agagcagaaa 
aaggcctggc 
tccagatgac 
cattgccgat 
agtggaccgt 
tggcgtggta 
actcgaagag 
aaaggtggtc 
ctgcgtggtc 
cattgcatca 
cctagggatg 
aggggactgc 
ttgcacgcaa 
gcatttaacg 
ttctgacttt 
aaaggtagtg 
acttggagtt 
tttaatccct 
tccaacatct 
tttagcattt 
tttttctgct 
tccacccgtg 
ccaatggggt 
gaccctttgc 
ataattttcc 
tcaaaaatgt 
aggaatctct 
gctggatttt 
cttcctgtaa 
ggacatctac 
atgctgcatt 
tcatgtatcc 
ctaactcttt 
agtctcacgg 
gcggcctctt 
ccagatattt 
accccagccg 
gctagctgga 
ggcttcatta 
tgtcagcttt 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AK084029 GenBank (R) 
AK084029 

AK084029 . 1 GI : 26101651 

492832-57-2 

3105 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 12 days embryo spinal ganglion cDNA, RIKEN 
full-length enriched library, clone:Dl30077F24 
product: weakly similar to ***semaphorin*** 

***6A*** PRECURSOR ( ***SEMAPHORIN*** *** VIA *** 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN) 



) (SEMA VIA) (SEMAPHORIN Q) (SEMA Q) [MUS musculus], 
full insert sequence. 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences center and Genome science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . r i ken . j p/ 
URL : http : //f antom . gsc . ri ken . jp/ - 



(os); 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 
OTHER SOURCE 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI) 



JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



Carninci , P. ; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol . , 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci, P.; Shibata,Y.; Hayatsu,N.; sugahara,Y. ; 
Shibata,K.; Itoh,M.; Konno,H.; okazaki,Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; ltoh,M.; Aizawa,K.; Nagaoka.S.; Sasaki f N. 
Carninci, P.; Konno,H.; Akiyama,J.; Nishi.K.; 
Kitsunai.T.; Tashiro,H.; ltoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto , R . ; Matsumoto ,H. ; Sakaguchi , s . ; Ikegami , T . ; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K.; Togawa,Y. ; 
Izawa,M.; Ohara,E.; Watahiki,M.; Yoneda,Y. ; 
Ishikawa.T.; 0zawa,K.; Tanaka,T. ; Matsuura,s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; lnoue,Y.; Kira,A. 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multicapillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on function 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3105) 

Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa,T.; Bono,H. 
Carninci, P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara.A.; Hashizume ,W. ; Hayashida.K. ; Hayatsu,N.; 
Hiramoto.K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
Imotani.K.; ishii.Y.; Itoh,M.; Kagawa,I.; Kasukawa,T. 
Katoh,H.; Kawai,J.; Ko;jima,Y.; Kondo,S.; Konno,H.; 
Kouda,M.; Koya,S.; Kunhara,c.; Matsuyama,T. ; 
Miyazaki.A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura,K.; Numazaki,R.; ohno,M.; Ohsato,N.; Okazaki,Y 
Saito,R.; Saitoh, H.; Sakai,C; Sakai,K.; Sakazume,N.; 
Sano,H.; Sasaki, D. ; shibata,K.; Shinagawa,A. ; 
Shiraki,T.; Sogabe,Y.; Tagami,M,; Tagawa,A.; 
Takahashi ,F. ; Takaku-Akahi ra,s. ; Takeda,Y.; Tanaka,T. 
Tomaru,A.; Toya^T. ; Yasunishi ,A. ; Muramatsu ,M. ; 
Hayashizaki ,Y. 



TITLE (TI): 
JOURNAL (SO): 



Direct submission 

Submitted (16-APR-2002) Yoshihide Hayashizaki, The 
Institute of Physical and Chemical Research (RIKEN) , 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc. riken.jp, URL: http://genome.gsc. riken.jp/, 
Tel =81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1..3105 



misc-feature 



<1. .646 



polyA-signal 
polyA-site 



3086. .3091 
3105 



/orqanism="Mus musculus" 

/moT-type="mRNA" 

/strain= M C57BL/6J" 

/ db-xref="FANTOM-DB : D130077F24" 

/db-xref="taxon: 10090" 

/clone="Dl30077F24" 

/ti ssue-type="spi nal gang! ion" 

/clone-lib="RlKEN full-length 

enriched mouse cdna library" 

/dev-stage="12 days embryo" 

/note="putative weakly similar to 

SEMAPHORIN 6A PRECURSOR 

(SEMAPHORIN VIA) (SEMA VIA) 

(SEMAPHORIN Q) (SEMA Q) [MUS 

musculus] (SWISSPROT 1 035464, 

evidence: FASTY, 60.6%ID, 

64.8%length, match=1719)" 

/note="putative" 

/note="putative" 



SEQUENCE (SEQ): 

1 tcttacgaag tcatccagta agcgggagca ccggcggtct gtggattcca ggaatactct 

61 caatgatctc ctgaagcatc taaatgaccc aaacagtaac cccaaagcca tcctgggaga 

121 gatccatatg gctcatcaaa ccctcatgct ggacccggtg ggaccaatgg ctgaggtccc 

181 acccaaggtc cctaaccggt aggcatctct atactcccct ccctccacac tccccagaaa 

241 tagtccaacc aagagagtag atgtccccac cactcctggg gtgccaatga cttctctgga 

301 aagacaaagg ggttatcaca aaaattcctc ccagaggcac tctatatctg ccgtgcctaa 

361 aaacttaaac tcaccaaatg gtgttttgtt atctagacag ccgagtatga accgtggagg 

421 ctatatgccc accccaacag gggcgaaggt ggactatatt caggggacac cggtgagtgt 

481 tcatctgcag ccctccctct ccagacagag cagctatacc agtaatggca ccctccccag 

541 gacgggacta aagaggacac catccttaaa acctgatgtg ccaccaaagc cttcctttgt 

601 tccgcaaacc acatctgtca gaccactgaa caagtacacg tactaggcct caagtatgct 

661 attcccgtgt ggctttatcc tgtccctgct gttgcgagga agctgtgagg gtaccttcag 

721 atgagatacc tgcttgtatt ttaagagaaa caagtagcca aagaaactct gtcactttgg 

781 taacaccaga acttgccaca tgtagctact acagcaaggc ttctgtgtac ttgccggaaa 

841 cgaagggagg tcctgctcac tccatttctt tcgtttgaag cagaagggat gtgtagccag 

901 ggaaggctcc cttcaccagt gtaaagagct gatacagtac tcagaagact gaacaaatac 

961 ttgaaaatgg gttcaatgta gactgccatt ctgtgtggtc ttcccattaa atgtgaacat 

1021 tttaatatgt atgcattcac cttgcctctt gcacaaatgt caaaatggaa agatgggaat 

1081 gtctcaaaac aaaatgagct tggagattac caagcagttt gctgaaaatt caatctttga 

1141 cccaaactgt agcaattttt attttctgag tgtggcactg ttttgtttta tttttttgtt 

1201 ttgttttgtt tttcaatgcc accaacaaac tatgttaaga gaggggcgag tgtgtgtgtg 

1261 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg agagagagag agagagagag agagagagag 

1321 agagagagat ctacccatta gaattgattt aggtgcccat tgcatctttt gtactatgga 

1381 gttgtttaca gtaagcatga ctgaacaagc actaacatcc tcctacaccg gtccatcgtg 

1441 ttcggctctg atcaggaata gacaaggctt ctttgactgt caagtgatta acatatggta 

1501 cccggtgtca gaacatttaa gcgttctaaa taattttgtg ctgctatcca tcagaacgtc 

1561 tattccagcc ttgccagctt aagaacttag agaaattaag gaggtatctt tgatgtatct 

1621 accagtattc aatagtaaca tttacctgtc cactgtgaat caaagcctgg ggcacgctac 

1681 tcaaccttag acacacttac aggcttttat tttgattgtg ttattttatt ttccatacag 

1741 taaaaaatgt cctctcttaa ctcctctcac ccccaactcc caaggcaaaa gagaaaccaa 

1801 aagtaggaca aaagaaagag acagacagac agacagacag aacaggaatt acagttggct 

1861 tagcagggag gatctgtaga gctctcgatg atccttcctc catgtgtgtc ctgtgaggat 

1921 tttgctcatg agtcatgcag gcagtgacag tgaactgtaa ataccacatg tgagtttacc 

1981 tggatctgtg cattttgtgc cttattcact cgagtggtag aagttccgcg gtatgttgag 

2041 tgtttccact acagagcatc atttcccatg tgccacttgt aaatgttttg cagccagcac 

2101 ctgaggactt cttatacagt cataaagcca cctagagcga tttgttgttg agcaacgccg 

2161 ccccttccct ccaggatcag aagcagcacc cacccttgcc atgataaaca ttccatcccc 

2221 tgctgttctg ataatgtgaa gtcagatgag ggttcccagg ttatgtggca cctgcggaaa 

2281 ccatgtctag agtcgtttct atgtagcttg gcagagccca ctgatggctg caggtgtgta 

2341 gcctactgac agctttggtt aacccactgc acatcaccca gtcatttacc caacttactc 

2401 tagaaatgtc cacagctaag gaaggctcct gagccagtca ggcggagttc aagtgatatc 

2461 ttgggacagt gaccatggtc tgcattactt gggggaggat ggggtacaag acaccagatc 



2521 atttttatac 
2581 tctatagaaa 
2641 ttgcaactaa 
2701 catctgccac 
2761 ctctgtgtga 
2821 ttctatcagt 
2881 aattttaaat 
2941 tacggcataa 
3001 aacgtgtata 
3061 tttttaacac 



aggatgtaga gtactgatgc 
aatgattccc tgtgatcttc 
aaatattaca gtttaataat 
caacaatgtt gtttgcattt 
ccccgataac attttgtaac 
atttatgttg aaatttctaa 
gctttatcca tttgtgcaaa 
aaaaataacc ctaaagtgaa 
gatatcttgt aaacttgtat 
cgcatgtaaa gtcaaaataa 



ggttgatctt ttccttcaag 
tggaagctcc aaagctgaaa 
caattaaacc aaccaacaat 
accttaccaa tattaatccc 
attgtgctgc cttagagttt 
catggattct agtctctatt 
ggtaaacaca gattgtatct 
gtggctctat actgttttat 
tgtggatgtg taaataatat 
aatatccaag tcatt 



aacattcttt 
cccttcagct 
aagcactaca 
agcgtggtaa 
gtactgtgag 
ctgttaattt 
tttttaatgg 
agagtacttt 
gtactttggg 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF): 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN) : 



GENBANK . RTM . COPYRIGHT 2004 on STN 

AK078490 GenBank (R) 
AK078490 

AK078490.1 GI: 26097898 

492822-90-9 

3509 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 12 days embryo female mullerian duct 
includes surrounding region cDNA, RIKEN full-length 
enriched library, clone : 6820415 J05 product :sema domain, 
transmembrane domain (TM) , and cytoplasmic domain, ( 



***semaphorin*** ) 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 
Eukaryot a ; Metazoa ; 



6A* 



full insert sequence. 



Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

cdna library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences Center and Genome science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . r i ken . j p/ 
URL : http : //f antom . gsc . r i ken . j p/ - 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI) : 
JOURNAL (SO): 
OTHER SOURCE (OS) 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 

JOURNAL (SO) ! 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) 



Carninci ,P. ; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol . , 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci, P.; shibata,Y.; Hayatsu,N.; Sugahara,Y.; 
shibata,K.; ltoh,M.; Konno,H.; okazaki,Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cdnas to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka.s.; Sasaki, N.; 
Carninci, P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai,T.; Tashiro,H.; Itoh,M.; sumi,N.; lshii,Y,; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A. ; 
YamamotOjR. ; Matsumoto,H. ; Sakaguchi ,s. ; Ikegami ,T. ; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K.; Togawa,Y.; 
Izawa.M.; Ohara,E.; Watahiki,M.; Yoneda,Y. ; 
Ishikawa,T.; Ozawa,K.; Tanaka,T.; Matsuura,s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; lnoue,Y.; Kira.A.; 
Hayashizaki , Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 



OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO): 



CA 134:203311 
5 

The FANTOM consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3509) 

Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa,T.; Bono,H.; 
Carninci,p.; Fukuda,S.; Furuno,M.; Hanagaki.T.; 
Hara,A.; Hashizume ,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane.T.; Hori,F.; 
lmotani,K.; lshii,Y.; Itoh.M.; Kagawa,l.; Kasukawa.T.; 
Katoh,H.; Kawai,j.; Kojima.Y.; Kondo,s.; Konno,H.; 
Kouda,M.; Koya.s.; Kunhara,C. ; Matsuyama,T. ; 
Miyazaki,A.; Nlurata,M.; Nakamura,M. ; Nishi,K,; 
Nomura,K.; Numazaki,R.; ohno,M.; ohsato,N.; Okazaki.Y.; 
Saito.R.; Saitoh, H.; sakai,C; sakai,K.; Sakazume,N.; 
Sano,H.; Sasaki ,D. ; shibata,K.; Shinagawa,A. ; 
Shiraki,T.; Sogabe,Y.; Tagami,M.; Tagawa,A. ; 
Takahashi ,F. ; Takaku-Akahi ra,S. ; Takeda,Y.; Tanaka,T.; 
Tomaru,A.; Toya,T. ; Yasunishi , A. ; Muramatsu,M. ; 
Hayashizaki ,Y. 
Direct submission 

submitted (16-APR-2002) Yoshihide Hayashizaki, The 
Institute of Physical and Chemical Research (RIKEN) , 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic sciences center (GSC) , RIKEN Yokohama 
institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc. riken.jp i URL: http://genome.gsc. riken.jp/, 
Tel : 81-45-503-9222 , Fax : 81-45-503-9216) 



FEATURES (FEAT): 

Feature Key Location 

source 1..3509 



Qualifier 

/orqanism="Mus musculus" 

/moT-type= M mRNA" 

/strain="c57BL/6J" 

/ db-xref=" FANTOM-DB : 6820415 J05 H 

/db-xref="taxon : 10090" 

/clone= ,, 6820415J05" 

/sex="female" 

/tissue-type="mullerian duct 
includes surrounding region" 
/clone-lib= , 'RlKEN full-length 
enriched mouse cDNA library" 
/dev-stage="12 days embryo" 
/note="sema domain, transmembrane 
domain (TM) , and cytoplasmic 
domain, (semaphorin) 6A 
(MGD | MGI: 1203727, GB | NM-018744 , 
evidence: BLASTN, 100%, 
match=1802)" 



misc-feature 



1..3509 



SEQUENCE (SEQ): 
1 gaaaacacat 
61 gtagacacaa 
121 agatctggaa 
181 tcaggtgcag 
241 aaaaggatcc 
301 cggataatag 
361 aacaaaataa 
421 gaagcctaca 
481 atcagaacaa 
541 caagtaacat 
601 aaagccagaa 
661 aggctactat 
721 gacaaaaaca 
781 gaaaagaagc 
841 atacctctag 
901 tagaagacac 
961 agcaacaaaa 
1021 gggtcacagc 
1081 gtttgctttc 



ccaaacaggt 
taaagaaaac 
ccatagatgc 
aagattccat 
taactcaaaa 
gaattgatga 
tagaagaaaa 
gaactccaaa 
caaatgcact 
ataaaggaag 
gagcctggac 
acccggccaa 
aattcacaca 
aatacaagga 
aatagagggt 
agatatttat 
taattttatg 
attaggaggg 
cgttgttgtg 



gatggaaatg 
ccaaagcgag 
gagcatcagc 
agagaacatc 
catccaggta 
gaatgaagat 
cttcccaaac 
tagactggac 
aaataaagat 
gcctatcaga 
agatgttata 
actctcaatt 
atatctttcc 
cggaagaata 
ctcaaactgt 
attaagattc 
tttgtggggg 
ttaagaacca 
ataaacacca 



aacaaaacca 
gcaacgctgg 
aacagaatac 
gacacaacag 
atgcaggaca 
tttcaactta 
ataaaaaaag 
cagaaaagaa 
agaatattaa 
attacaccag 
cagacactaa 
accatagatg 
acgaatccag 
tttagcataa 
aggtcactat 
ataccagtag 
ggtcacaata 
ctgctctcaa 
agacaaaaaa 



tactagaact 
agatagaaac 
aagaaatgga 
tcaaagaaaa 
caatgagaag 
aagggccagc 
agatgcccat 
attcctcccg 
aagcagtaag 
acttttcacc 
gagaacacaa 
gagaaaccaa 
cccttcaaag 
tccccttctt 
ccaaaaatag 
caaagttata 
acataaagaa 
acctctaaca 
gcaacttacg 



aaaaggggaa 
cctaggaaag 
agagagaatc 
tacaaaatgc 
accaaaccta 
taatatcttc 
gatcatacaa 
acacataata 
ggagaaaggt 
agagactatg 
atgccagccc 
agtattccac 
gataataaca 
tgtagaacat 
gaccatccat 
attatgaagt 
ctatattaaa 
agatacccta 
gagaatttat 



1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 



tacagtttat 
tgcttcctgg 
acaccccagg 
tcaattaacc 
tttgtgtcaa 
tccagttcca 
tggagatgca 
aggaaatctc 
gcctgtggta 
gccagaaggc 
tgtgctggat 
ctgaacttca 
ccttggttcc 
tttccttgac 
caaacatctt 
atgccaccaa 
gttctacaca 
attaatccat 
ctgtttgggg 
catgccaaca 
aagcagtggc 
attttcaaag 
tttatgccct 
gctctattac 
taggtatcca 
ccttacttat 
gtagtactgc 
aaagttttct 
tgtccactat 
ctgctctctc 
agtggtttca 
ttccttctcc 
gaccagacca 
ctagaaagcc 
aaagagattt 
attgcagtag 
gaaaagggaa 
agccttttcc 
ctttcttggt 
agacaagttg 



catcaggaga 
cttccctctt 
ctcgctttcc 
attaagaaca 
gttgacagat 
tctctgttca 
actgagctgt 
agcatgttgt 
tagtctcctt 
atgaacaact 
catcctcttt 
ttaagacgca 
tgagaacaat 
tttatcttgt 
ctgctctcct 
ctccccattg 
tgccagcaat 
tctgcgagtt 
tttacacatg 
tgaaggggtt 
caactgcgga 
aagatgatgt 
ggccttactg 
ctgtgggaag 
ccaaaggtga 
ttaccactct 
tctgagccct 
aaacttgttg 
tttccttttc 
cattgaaacc 
tatggttatg 
cgttggagaa 
gcttgagaac 
cgtgccacac 
ttttgttttt 
acaacgctgc 
tcatcctgaa 
tggaggagat 
aactatttgt 
tgtgatcaca 



agctaaggtg 
ctaggtcgct 
aggaggtggt 
tggctcacag 
agtaacgcat 
ctggggtttg 
tctggctaag 
acttggtggc 
catgccagtt 
ctatcccctc 
agaaaaatat 
tccactcatg 
ggtcaggtga 
gttcacggat 
gggcatggat 
ccacattctc 
tagaagatag 
acaagggagt 
actacattgc 
acgctttggt 
gatctctgac 
tttttttcta 
aacatatcat 
gctggtgttc 
aacagacata 
ggccccttca 
ccacttttag 
acagaggttt 
aacgttaaaa 
atcagtcttt 
gatatttatt 
gttggagtcg 
attacttaca 
ttgtttgaag 
caaatgtgat 
cgggccatat 
gttcttggcc 
gaatgtttac 
tattctactg 
ccgtttcct 



ggaacttgac 
tcctctagct 
accgcccacg 
acaagcctgt 
ccctgagagg 
tttgcgcgtc 
gctctctcat 
attctgcatt 
acttagagaa 
caccccttgt 
gcaacctcca 
gacgaggcag 
gagtgtcaaa 
cctgtcattg 
tcagctagga 
agtgagaaga 
aaaaaaaaat 
cctgaagcag 
tggcatgaag 
atttttttac 
agcattaaac 
aacagtggca 
gaactctctt 
tctagatcct 
aagttccata 
cagatatggt 
ggccctcttt 
tatgcctcct 
agaaatgcta 
ccttattgct 
tattatagaa 
tcatactaat 
tgcatatata 
tggaaagaga 
cctctcattt 
cagaatcaca 
aggataggaa 
aacccagaaa 
cgacctctcc 



ccagaaacca 
tgctcagcca 
gtgggcaggg 
gtgttagagg 
cgtgtcctca 
tggaaggatg 
ggaatgagga 
gttcaacatc 
gcagttaatt 
tttcttttag 
atgagtttcc 
tgccttccat 
caaactgcat 
gatggtgctt 
aatgttgaca 
gtgccagtat 
atttccttcc 
gacccgtcag 
gggcacagca 
ctgaagtgat 
atcctttgtg 
gttgataaca 
gactataacc 
gtcttggtac 
gaaagaggat 
ttggcttgca 
acccctttgt 
aggaaaagag 
agtatattcc 
catggaaggg 
cttggcctct 
gtctgcatat 
tttttttcat 
ccctttaatg 
ttagataccg 
ctgtggtttt 
gcagtggttt 
agtgagtgga 
cattcatgga 



caaggcacac 
cctctcttgt 
ctttcttaca 
tatgtctagc 
cacactcatt 
ccaagttgcc 
ggcagtccag 
ctcttcttgt 
caaagagttt 
gccaggctgt 
cgatgatacc 
catcaacaga 
ccaaagtgga 
tgaaatgcca 
ggctgggtag 
tttcaggcta 
agattagctc 
aagtacttac 
tttgtagtgg 
gccaacgtag 
attgcaggtc 
caaaatgatt 
tagaaaccat 
ttagagcaaa 
gtccactagg 
tgccctgatt 
gtacaacctg 
tgaaggatgg 
cttcctacat 
atcattgctg 
gtacacatac 
gaaaataagg 
gacattgtta 
atagacttac 
cctgaccaaa 
cctgggatca 
cctaaatggc 
acgttgggtc 
aaatcttggc 
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LOCUS (LOC): 
GenBank acc. no. (gbn) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AK031307 GenBank (R) 
AK031307 

AK031307.1 Gl: 26082260 

486370-53-0 

4374 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 13 days embryo male testis cdna, riken 
full-length enriched library, clone :6030405B10 
product: weakly similar to ***SEMAPHORlN*** 

***6A*** PRECURSOR ( ***SEMAPHORIN*** *** V IA*** 
) (SEMA VIA) (SEMAPHORIN Q) (SEMA Q) [MUS musculus], 
full insert sequence. 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 



sciurognathi ; Muridae; Murinae; Mus 
COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 

URL : http : //genome . gsc . ri ken . j p/ 

URL : http : //f antom . gsc . ri ken . j P/ • 
REFERENCE: 1 

AUTHOR (AU) : Carni nci , p . ; Hayashi zaki , Y . 

TITLE (Ti) : High-efficiency full-length cDNA cloning 

JOURNAL (so): Meth. Enzymol., 303, 19-44 (1999) 



OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO): 
OTHER SOURCE (OS) ; 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI); 



JOURNAL (SO): 

reference: 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO): 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO): 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 



CA 131:318304 
2 

Carninci.P.; shibata,Y.; Hayatsu.N.; sugahara.Y. ; 
Shibata,K.; itoh,M.; Konno.H.; Okazaki,Y.; 
Muramatsu,M. ; Hay as hi zaki , Y . 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka,S.; Sasaki, N.; 
carninci,P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai,T.; Tashiro,H.; ltoh,M.; sumi,N.; lshii,Y.; 
Nakamura,S.; Hazama,M.; Nishine,T.; Harada,A.; 
Yamamoto,R. ; Matsumoto,H. ; Sakaguchi ,s. ; ikegami ,T. ; 
Kashiwagi ,K. ; Fujiwake,S.; lnoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
Ishikawa,T.; Ozawa,K. ; Tanaka,T.; Matsuura,s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; lnoue,Y.; Kira,A.; 
Hay as hi zaki , Y. 

riken integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 4374) 

Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa f T.; Bono,H.; 
Carninci,P.; Fukuda,S.; Furuno,M.; Hanagaki,T.; 
Hara,A. ; Hashizume ,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
lmotani,K.; lshii,Y.; Itoh.M.; Kagawa,I.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Koqima.Y.; Kondo,S.; Konno,H.; 
Kouda,M.; Koya,S.; Kunhara,C. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K. ; Numazaki,R.; ohno,M.; ohsato,N.; okazaki,Y.; 
saito,R.; saitoh,H.; sakai,c; sakai,K.; sakazume,N.; 
sano,H.; Sasaki, D.; Shibata,K.; Shinagawa,A. 
shiraki,T.; Sogabe,Y.; Tagami,M. 
Takahas hi , F. ; Takaku-Akahn ra , S . ; 
Tomaru,A.; Toya,T.; Yasunishi,A. 
Hayashizaki , Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
Institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . j j p/ , 
Tel : 81-45-503-9222 , Fax : 81-45-503-9216) 



Tagawa,A. ; 
Takeda,Y. ; Tanaka,T. 
; Muramatsu,M. ; 



FEATURES (FEAT): 
Feature Key 



source 



Location 
.4374 



Qual ifier 



/organism="Mus musculus" 

/mol-type= ,, mRNA M 

/strain= ,, c57BL/6J" 

/db-xref =" FANTOM-DB : 6030405B10" 

/db-xref="taxon: 10090" 

/clone="6030405Bl0 M 

/sex= ,, male" 

/tissue-type="testis" 



/clone-lib="RlKEN full-length 
enriched mouse cDNA library" 
/dev-stage="13 days embryo" 
misc-feature 1. .4374 /note="weakly similar to 

SEMAPHORIN 6A PRECURSOR 
(SEMAPHORIN VIA) (SEMA VIA) 
(SEMAPHORIN Q) (SEMA Q) [MUS 
musculus] (SWISSPROT 1 035464, 
evidence: FASTY, 60.6%ID, 
64.8%length, match=1719)" 

SEQUENCE (SEQ): 

1 ggtttcttct tggcagggcc ggctcggcgc cgctccgcaa gccgtcaggg tgagcccgcc 
61 ttcctcgctg cggggcttgg agcctccagc ccgcccgcag cgcagcccgg gccgccgggc 
121 cagcttcggc cgggagaacg cagggacagc ctgggagtgc ggagccactg actgtccacg 
181 cggggaccgt gagcacccac tcgcgggtct cccagccact ctcccccgca actttgcgtg 
241 ctttgttctc tggctgaaaa aatgctccac agaagcgcca tagaccgggt ctcctccagg 
301 cggcaagggc tcggccggga gacgtgttac gctctcagct ccccatcaat tatggatgga 
361 aacaaataag gaaaagtcaa ttttgtatga gccgcgtctt tcgcagccag aggcgtcttg 
421 tagcctggag cagaggcagg caaaccgagc ccaacaacca ggtagctaag tgggacttct 
481 gaggaggggg agctcagatt tcttcttggc caaccatggg gttccttctg ctttggttct 
541 gcgtgctgtt ccttctggtc tccaggttac gggcggtcag cttcccagaa gacgatgagc 
601 ccctcaacac ggttgactat cactattcaa ggcaatatcc ggtttttaga ggacgccctt 
661 caggcaacga atcgcagcac aggctggact ttcagctgat gttgaaaatt cgagacacac 
721 tttatattgc tggcagggat caagtctata cagtgaactt aaatgaaatc ccccaaacag 
781 aggtgatacc aagcaagaag ctgacgtgga ggtccagaca gcaggatcga gaaaattgtg 
841 ctatgaaagg caagcataaa gatgaatgcc acaacttcat caaagtcttt gtcccaagaa 
901 atgatgagat ggtttttgtc tgtggtacca atgctttcaa cccgatgtgc agatactata 
961 ggttgagaac gttagagtat gatggggaag aaattagtgg cctggcacga tgcccgtttg 
1021 atgcccgaca aaccaatgtc gccctctttg ctgatggaaa actctattct gccacagtgg 
1081 ctgatttcct ggccagtgat gctgtcattt acagaagcat gggagatgga tctgcccttc 
1141 gcacaataaa atacgattcc aagtggatca aagaaccaca cttccttcat gccatagaat 
1201 atggaaacta tgtctatttc ttcttcagag aaatcgccgt ggaacataat aacttaggca 
1261 aggctgtgta ttcccgcgtg gctcgcattt gtaaaaacga catgggtggc tcacagcggg 
1321 tcctggagaa acactggact tccttcctta aggctcggct gaactgctcc gttcctggag 
1381 attccttttt ctacttcgac gtcctgcagt ctataacaga cataatccaa atcaatggca 
1441 tccccactgt ggttggggtc ttcaccacac agctcaacag cattcctggt tctgcagtct 
1501 gtgcctttag catggacgac attgagaaag tgttcaaagg gcggttcaaa gagcagaaaa 
1561 ccccagactc tgtttggaca gcagttcccg aagacaaagt accaaaacca aggcctggct 
1621 gttgtgccaa acacggccac gcagaagctt acaagacctc catcgacttt ccagatgaca 
1681 ccctggcttt catcaagtcc cacccgctga tggactctgc cgtcccaccc attgccgatg 
1741 agccctggtt cacaaagaca cgggtcaggt acaggttgac agccatcgaa gtggaccgtt 
1801 cagcagggcc ataccaaaac tacacagtca tctttgttgg ctctgaagct ggcgtggtac 
1861 ttaaagtttt ggcaaagacc agtcctttct ctctgaatga cagtgtatta ctcgaagaga 
1921 ttgaagctta taacccagcc aagtgcagcg ccgagagtga ggaggacaga aaggtggtct 
1981 cattacagct ggacaaggat caccatgctt tatacgtggc cttctctagc tgcgtggtcc 
2041 gcatccccct cagccgctgt gagcgctacg gatcgtgtaa aaagtcttgc attgcatcac 
2101 gtgacccgta ctgtggttgg ttaagccagg gagtttgtga gagagtgacc ctagggatgc 
2161 tccctggagg atatgagcag gacacggagt acggcaacac agcccaccta ggggactgcc 
2221 acgaaagttt gcctccttca actacaccag attacaaaat atttggcggt ccaacatctg 
2281 acatggaggt atcctcatct tctgttacca ctgtggcaag tagcccagaa attacatcta 
2341 aagtgattga tacctggaga cctaaactga cgagctcccg gaaatttgta gttcaagatg 
2401 acccaaatac ttctgatttt actgatacta tatcaggtgt acggtgggaa gtccagtctg 
2461 gagaatccaa tcagatggtc cacatgaatg tcctcatcac ctgcgtgttt gccgcttttg 
2521 tcttgggcgc gttcatcgca ggagtggccg tgtactgcta ccgtgacatg ttcgttcgga 
2581 agaacataaa gatccataaa gacgcagaat ccgcccagtc gtgcacagac tccagcggaa 
2641 gcttcgccaa gctgaacggc ctctttgaca gccccgtcaa ggaataccag cagaacattg 
2701 attctcccaa actctacagc aacctgctga ccagtcggaa ggaactgcca ccaaacacgg 
2761 atacaaagtc catggccgtg gaccacagag gccagcctcc cgagctggct gctctcccca 
2821 cgccggaatc cacacctgtc ctccaccaga agaccctgca ggccatgaag agccactctg 
2881 agaaggccca cagccacggt gcttcaagga aagaacaccc ccagtttttt ccttctagtc 
2941 ctccacccca ttccccattg agtcacgggc atatccccag tgccatcgtt cttccaaacg 
3001 ccactcacga ctacaataca tccttctcca actcgaatgc ccacaaagcc gaaaagaagc 
3061 ttcagagcat ggatcaccct cttacgaagt catccagtaa gcgggagcac cggcggtctg 
3121 tggattccag gaatactctc aatgatctcc tgaagcatct aaatgaccca aacagtaacc 
3181 ccaaagccat cctgggagag atccatatgg ctcatcaaac cctcatgctg gacccggtgg 
3241 gaccaatggc tgaggtccca cccaaggtcc ctaaccggga ggcatctcta tactcccctc 
3301 cctccacact ccccagaaat agtccaacca agagagtaga tgtccccacc actcctgggg 
3361 tgccaatgac ttctctggaa agacaaaggg gttatcacaa aaattcctcc cagaggcact 
3421 ctatatctgc cgtgcctaaa aacttaaact caccaaatgg tgttttgtta tctagacagc 
3481 cgagtatgaa ccgtggaggc tatatgccca ccccaacagg ggcgaaggtg gactatattc 
3541 aggggacacc ggtgagtgtt catctgcagc cctccctctc cagacagagc agctatacca 
3601 gtaatggcac cctccccagg acgggactaa agaggacacc atccttaaaa cctgatgtgc 
3661 caccaaagcc ttcctttgtt ccgcaaacca catctgtcag accactgaac aagtacacgt 
3721 actaggcctc aagtatgcta ttcccgtgtg gctttatcct gtccctgctg ttgcgaggaa 
3781 gctgtgaggg taccttcaga tgagatacct gcttgtattt taagagaaac aagtagccaa 



3841 agaaactctg 
3901 tctgtgtact 
3961 agaagggatg 
4021 cagaagactg 
4081 tcccattaaa 
4141 aaaatggaaa 
4201 ctgaaaattc 
4261 tttgttttat 
4321 gaggggcgag 



tcactttggt aacaccagaa 
tgccggaaac gaagggaggt 
tgtagccagg gaaggctccc 
aacaaatact tgaaaatggg 
tgtgaacatt ttaatatgta 
gatgggaatg tctcaaaaca 
aatctttgac ccaaactgta 
tttttttgtt ttgttttgtt 
tgtgtgtgtg tgtgtgtgtg 



cttgccacat gtagctacta 
cctgctcact ccatttcttt 
ttcaccagtg taaagagctg 
ttcaatgtag actgccattc 
tgcattcacc ttgcctcttg 
aaatgagctt ggagattacc 
gcaattttta ttttctgagt 
tttcaatgcc accaacaaac 
tgtgtgtgtg tgtgtgtgag 



cagcaaggct 
cgtttgaagc 
atacagtact 
tgtgtggtct 
cacaaatgtc 
aagcagtttg 
gtggcactgt 
tatgttaaga 
agag 
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locus (LOC) : 
GenBank ACC. NO 
GenBank version 

CAS REGISTRY NO 
SEQUENCE LENGTH 
MOLECULE TYPE (CI) 
DIVISION CODE (CI) 
DATE (DATE): 
DEFINITION (DEF) : 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



(GBN) 
(VER) : 
(RN): 
(SQL) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

CA341055 GenBank (R) 
CA341055 

CA341055.1 GI:24559151 
472461-69-1 
477 

mRNA; linear 
Expressed sequence tag 
4 Nov 2002 

haa83a04.yl Fugu hgmpG adult eye Takifugu rubripes cDNA 
clone 6360486 5' simile 



lar to TR:035464 035464 
***VIA*** PRECURSOR . 



mRNA 



NUCLEIC ACID COUNT (NA) 
COMMENT : 

Contact: Stephen L 



***SEMAPHORIN*** 
sequence. 
EST 

Takifugu rubripes 
Takifugu rubripes 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopteryqii; Teleostei; 
Euteleostei; Neoteleostei ; Acanthomorpna; 
Acanthopterygii ; Percomorpha; Tet raodonti formes ; 
Tetraodontidae ; Takifugu 
: 108 a 118 c 144 g 107 t 



Johnson 

Washington University School of Medicine 
4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel : 314 286 1800 
Fax: 314 286 1810 
Email : zbrafish@watson.wustl .edu 

Library materials and construction by Greg Elgar (UK MRC HGMP-RC) . 
DNA Sequencing by: Washington university Genome sequencing Center 
Clone distribution: Fugu clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL, send email to: 
i nf o@i mage . Tl nl . gov 
Seq primer: T3 ET from Amersham. 



REFERENCE: 

AUTHOR (AU) 



TITLE (TI): 
JOURNAL (SO): 

FEATURES (FEAT) : 
Feature Key 



1 (bases 1 to 477) 
Clark, M.; Dohnson ,S . L. ; Lehrach,H.; Lee,R.; Li , F. ; 
Marra,M.; Eddy,S.y; Hillier,L.; Kucaba,T. ; Martin, J.; 
Beck,C; Wylie,T.; Underwood,K. ; steptoe,M.; 
Theising,B.; Allen,M.; Bowers, Y.; Person, B.; 
Swaller,T.; Gibbons,M.; Pape,D.; Harvey, N.; schurk,R. 
Ritter.E.; Kohn,s.; shin.T.; Jackson, Y. ; Cardenas, M.; 
McCann,R.; wilson,R. 
Washu zebrafish EST Project 1998 
unpublished (1998) 



Location 



source 1. .477 



Qualifier 

/organi sm="Taki fugu rub ri pes " 
/db-xref="taxon: 31033" 
/clone= n 6360486 n 

/clone-lib= M Fugu hgmpG adult eye" 
/dev-stage="adult" 
/l ab-host="DHl0B (Tl- resi stant) " 
/note="Organ: eye; Vector: 
pBluescript II KS (Stratagene) ; 
Site-1: Xhol; Site-2: EcoRI; 
Library is oligo-dT primed and 
directionally cloned using cDNA 
prepared with the Stratagene cDNA 
synthesis kit. Tissue source: 
adult eye. Library materials and 
construction by Greg Elgar (UK 

MRC HGMP-RC) . " 



SEQUENCE (SEQ) : 

1 ccggtcctgg 
61 cccgttgtct 
121 atttcagcgc 
181 tgtgccgaag 
241 cagagttgta 
301 gtagcagagg 
361 gtcagaggcc 
421 ggacagtgaa 



gacgctcgtc tgcttcagtt 
tccctccgca gagtggattt 
ggcatgtaac aaaaagaagc 
cgggcgccgt ctgaactgcc 
gaaccactcc gagacgcatc 
accagcggca gcggcggcgg 
ttgttactct acttcacgct 
cctattagca tctcgcacag 



tcaccgaggg gagtttcgaa 
ccgatggata ttttgctgtg 
gcaagagaaa agtagcgctc 
gtgagcgccg tctcatggat 
cattcatgag ctgaaggctg 
cggctgtggc agtgttgttg 
actgcagata gctggggcgg 
caactacaca aagcagtatc 



atcaaaccag 
aagaggtgtg 
gtattgttgt 
tactaaaggg 
agaggattct 
cgaccatgag 
gcttccccga 
cggcgtt 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



BC032619 GenBank (R) 
BC032619 

BC032619.1 Gl: 22749800 

453031-06-6 

6875 

inRNA; linear 

High-Throughput CDNA Sequencing 
4 Mar 2003 

Homo sapiens, similar to sema domain, transmembrane 
domain (TM) , and cytoplasmic domain, ( 



) 



*6A* 



clone IMAGE: 5578066, 



KEYWORDS (ST) : 
SOURCE: 
ORGANISM (ORGN) 



***semaphorin*** 
mRNA. 
HTC 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 1784 a 1795 c 1592 g 1704 t 
COMMENT : 

contact: mgc help desk 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC/DCTD/DTP 

CDNA Library Preparation: Life Technologies, Inc. 

CDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA sequencing by: National institutes of Health intramural 

sequencing Center (nisc) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact: nisc_mgc@nhgri .nih.gov 

Akhter,N., Ayele,K., Beckstrom-Sternberg,s .M. , Benjamin, B., 
Blakesley,R.w. , Bouffard ,G.G. , Breen,K., Brinkley,C. f Brooks, s., 
Dietrich, N.L. , Granite, S., Guan,x., Gupta, J., Haghighi,P., 

Ho,s.-L., Karlins,E., Kwong,P., Laric,P. f Leqaspi,R., 
, Masiello.C, Maskeri,B., Mastrian.S.D. .Mccloskey, J .c. , 
, Pearson, R., stantripop,s. , Thomas, P. J., Touchman , 3 .w. , 
, Vogt,J.L., Walker,M.A., Wetherby ,K.D. , wiggins,L., 
Young, A., Zhang,L.-H. and Green,E.D. 

clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
series: IRAK Plate: 69 Row: o Column: 4 

This clone was selected for full length sequencing because it 

following selection criteria: matched mRNA gi : 11991659 
has the following problem: retained intron. 
1 (bases 1 to 6875) 
Strausberq,R. 
Direct Submission 

Submitted (06-JUN-2002) National Institutes of Health, 
Mammalian Gene collection (MGC), Cancer Genomics 
Office, National Cancer Institute, 31 center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



Hansen, N. _ 
Maduro,Q.L. 
McDowel 1,3. 
Tsurgeon,c. 



passed the 
This clone 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) 



FEATURES (FEAT) 
Feature Key 



Location 



source 1..6875 



Qual ifier 

/organism="Homo sapiens" 

/db-xref="taxon:9606" 

/cl one=" IMAGE : 5578066" 

/ti ssue-type="ski n , mel anoti c 

melanoma. 

/cl one-1 i b="NIH-MGC-72" 
/lab-host="DHlOB" 
/note="Vector : pCMV-SPORT6 n 



SEQUENCE (SEQ) : 

1 ccacgcgtcc gggagacgcc agccaaagta ggcggagttg cggaaggaga gtccgatctg 
61 cgcaatccag tccgggactt ggctccgccc gggagcggcg gcctctccgg tgacagagga 
121 gtagtgagcg gcacggggtg aggctgcagc caactccgct ccccgcgcac tcggctgccc 
181 aggcgctcgg aacccagcag cggcgctcct ccgcggtgcc ggtcgcccgc gatgcccgct 
241 tagcagcgtg tagcagcggc cagcatcacc acacccgcgg caccgcgctg ccggccgcag 
301 agccgggcca gagccttgca cccctgcctc agcccccacc ccgccccccg ccctgaattg 
361 acttgttaat cggcgcagac accaccaagg ggactcaccg gagtggaatc caagtggaat 
421 ttggatttgg agaagagttt cttgaacatt taccctcttc cttgttggtt ttctttttct 
481 ttttcttctt tttttttttg gcttcttttt tcctctcccc ttctccgctc gtcattggag 
541 atgaacacat cgcgtttgca tcccagaaag tagtcgccgc gactatttcc cccaaagaga 
601 caagcacaca tgtaggaatg acaaaggctt gcgaaggaga gagcgcagcc cgcggcccgg 
661 agagatcccc tcgataatgg attactaaat gggatacacg ctgtaccagt tcgctccgag 
721 ccccggccgc ctgtccgtcg atgcaccgaa aagggtgaag tagagaaata aagtctcccc 
781 gctgaactac tatgaggtca gaagccttgc tgctatattt cacactgcta cactttgctg 
841 gggctggttt cccagaagat tctgagccaa tcagtatttc gcatggcaac tatacaaaac 
901 agtatccggt gtttgtgggc cacaagccag gacggaacac cacacagagg cacaggctgg 
961 acatccagat gattatgatc atgaacggaa ccctctacat tgctgctagg gaccatattt 
1021 atactgttga tatagacaca tcacacacgg aagaaattta ttgtagcaaa aaactgacat 
1081 ggaaatctag acaggccgat gtagacacat gcagaatgaa gggaaaacat aaggatgagt 
1141 gccacaactt tattaaagtt cttctaaaga aaaacgatga tgcattgttt gtctgtggaa 
1201 ctaatgcctt caacccttcc tgcagaaact ataagatgga tacattggaa ccattcgggg 
1261 atgaattcag cggaatggcc agatgcccat atgatgccaa acatgccaac gttgcactgt 
1321 ttgcagatgg aaaactatac tcagccacag tgactgactt ccttgccatt gacgcagtca 
1381 tttaccggag tcttggagaa agccctaccc tgcggaccgt caagcacgat tcaaaatggt 
1441 tgaaagaacc atactttgtt caagccgtgg attacggaga ttatatctac ttcttcttca 
1501 gggaaatagc agtggagtat aacaccatgg gaaaggtagt tttcccaaga gtggctcagg 
1561 tttgtaagaa tgatatggga ggatctcaaa gagtcctgga gaaacagtgg acgtcgttcc 
1621 tgaaggcgcg cttgaactgc tcagttcctg gagactctca tttttatttc aacattctcc 
1681 aggcagttac agatgtgatt cgtatcaacg ggcgtgatgt tgtcctggca acgttttcta 
1741 caccttataa cagcatccct gggtctgcag tctgtgccta tgacatgctt gacattgcca 
1801 gtgtttttac tgggagattc aaggaacaga agtctcctga ttccacctgg acaccagttc 
1861 ctgatgaacg agttcctaag cccaggccag gttgctgtgc tggctcatcc tccttagaaa 
1921 gatatgcaac ctccaatgag ttccctgatg ataccctgaa cttcatcaag acgcacccgc 
1981 tcatggatga ggcagtgccc tccatcttca acaggccatg gttcctgaga acaatggtca 
2041 gataccgcct taccaaaatt gcagtggaca cagctgctgg gccatatcag aatcacactg 
2101 tggtttttct gggatcagag aagggaatca tcttgaagtt tttggccaga ataggaaata 
2161 gtggttttct aaatgacagc cttttcctgg aggagatgag tgtttacaac tctgaaaaat 
2221 gcagctatga tggagtcgaa gacaaaagga tcatgggcat gcagctggac agagcaagca 
2281 gctctctgta tgttgcgttc tctacctgtg tgataaaggt tccccttggc cggtgtgaac 
2341 gacatgggaa gtgtaaaaaa acctgtattg cctccagaga cccatattgt ggatggataa 
2401 aggaaggtgg tgcctgcagc catttatcac ccaacagcag actgactttt gagcaggaca 
2461 tagagcgtgg caatacagat ggtctggggg actgtcacaa ttcctttgtg gcactgaatg 
2521 ggcattccag ttccctcttg cccagcacaa ccacatcaga ttcgacggct caagaggggt 
2581 atgagtctag gggaggaatg ctggactgga agcatctgct tgactcacct gacagcacag 
2641 accctttggg ggcagtgtct tcccataatc accaagacaa gaagggagtg attcgggaaa 
2701 gttacctcaa aggccacgac cagctggttc ccgtcaccct cttggccatt gcagtcatcc 
2761 tggctttcgt catgggggcc gtcttctcgg gcatcaccgt ctactgcgtc tgtgatcatc 
2821 ggcgcaaaga cgtggctgtg gtgcagcgca aggagaagga gctcacccac tcgcgccggg 
2881 gctccatgag cagcgtcacc aagctcagcg gcctctttgg ggacactcaa tccaaagacc 
2941 caaagccgga ggccatcctc acgccactca tgcacaacgg caagctcgcc actcccggca 
3001 acacggccaa gatgctcatt aaagcagacc agcaccacct ggacctgacg gccctcccca 
3061 ccccagagtc aaccccaacg ctgcagcaga agcggaagcc cagccgcggc agccgcgagt 
3121 gggagaggaa ccagaacctc atcaatgcct gcacaaagga catgcccccc atgggctccc 
3181 ctgtgattcc cacggacctg cccctgcggg cctcccccag ccacatcccc agcgtggtgg 
3241 tcctgcccat cacgcagcag ggctaccagc atgagtacgt ggaccagccc aaaatgagcg 
3301 aggtggccca gatggcgctg gaggaccagg ccgccacact ggagtataag accatcaagg 
3361 aacatctcag cagcaagagt cccaaccatg gggtgaacct tgtggagaac ctggacagcc 
3421 tgccccccaa agttccacag cgggaggcct ccctgggtcc cccgggagcc tccctgtctc 
3481 agaccggtct aagcaagcgg ctggaaatgc accactcctc ttcctacggg gttgactata 
3541 agaggagcta ccccacgaac tcgctcacga gaagccacca ggccaccact ctcaaaagaa 
3601 acaacactaa ctcctccaat tcctctcacc tctccagaaa ccagagcttt ggcaggggag 
3661 acaacccgcc gcccgccccg cagagggtgg actccatccc ggtgcacagc tcccagccat 
3721 ctggccaggc cgtgactgtc tcgaggcagc ccagcctcaa cgcctacaac tcactgacaa 
3781 ggtcggggct gtagcgtacg ccctcgctaa agccggacgt accccccaaa ccatcctttg 
3841 ctcccctttc cacatccatg aagcccaatg atgcgtgtac ataatcccag ggggaggggg 
3901 tcaggtgtcg aaccagcagg caaggcgagg tgcccgctca gctcagcaag gttctcaact 
3961 gcctcgagta cccaccagac caagaaggcc tgcggcagag ccgaggacgc tgggtcctcc 
4021 tctctgggac acaggggtac tcacgaaaac tgggccgcgt ggtttggtga aggtttgcaa 
4081 cggcggggac tcaccttcat tctcttcctt cactttcccc cacaccctac aacaggtcgg 
4141 acccacaaaa gacttcagtt atcatcacaa acatgagcca aaagcacata cctaccccat 
4201 cccccacccc cacacacaca cacacatgca cacaacacat acacacacac gcacagaggt 
4261 gaacagaaac tgaaacattt tgtccacaac ttcacgggac gtggccagac tgggtttgcg 
4321 ttccaacctg caaaacacaa atacattttt taaaatcaag aaaatttaaa aagacaaaaa 
4381 aaaaagaatt cattgataat tctaactcag actttaacaa tggcagaagt ttactatgcg 



4441 caaatactgt gaaatgcccg ccagtgttac agctttctgt tgcagcagat aaatgccatg 
4501 ttgggcaact atgtcataga tttctgctcc tcctctcttt taatgaaata acgtgaccgt 
4561 taacgcaagt aactctttat ttattgttca cccttttttt ccttaaggaa aggactcttc 
4621 caaatatcat cctatgaaca gctcttcaga aagcccattg aaagttaaac tatttaacgt 
4681 gaaatccatt aactggaata attgagtttc tttattttta caataaattc actgagtaaa 
4741 taagttggag ctggaattct gagctttgtg tttggactgt ctggtctgta gctaaaggaa 
4801 aaagttagga ggggaatatt tatttaaatc tatcagttca ttgcctggtc aggtctctgg 
4861 cctataaaca tgcaaaaaaa ctaattcatg ttaaagtcct tccagatcct aatttctaaa 
4921 gcaatcagga gagaaagctt ttagaatggc acatcatatc tcattgatgc caacatggtt 
4981 ttgtccatgg ttctgacttt ctgtgaaggc accagcttgc aatatgccat cccatttcac 
5041 cttgcatgtg agacagcaaa caaaatccac aaatggtgtg aactaatatg ctggctgcta 
5101 ccttgcataa attaatgatt tgatcacacg ggttcttcgt ggggttacat ctgtgaatag 
5161 cctgttttcc acatgtaaat ttgtgcctta caccttgagt tgtgtacact tgtaaactct 
5221 ttatgatcaa ctgttccccc ttttgaaata agtgcagata tttatttaac cctcccttcc 
5281 ccaccctctg ccccacttcc agccctctga aagattggag tcaagcagat ggaagaatgc 
5341 agtggtgata gttgtcatgc gacagcctga gaacgctggg cagcaccaca ccctccaatt 
5401 cacactgcct tctagttgtg ccaactggaa ccaccctttg gctgtgctgc gaagcatgga 
5461 ccccagtgtt gttgtgggtg tgtcaaatcc cctttcatcc tcaagagctc cctgcttccc 
5521 ttagattatt tcaatacggt gatatcctta tttgctagca gaaaagggac taacgtccca 
5581 ttcctctttt ctgctgcgtc cactggctag agagcaggcg gtgcgcggtt gggcagacac 
5641 ctgggaggag tctccaagcc atgtgcacag cacacacgtg cagtgcacac aaagaaatga 
5701 catggaaata gatgcaggca ggctggtccc tgctgtgatt aacgagtaac tccaagtaca 
5761 aggccgacca caatggatgc tgcaaaaacg ttgactgggg caaaggattt tttattttat 
5821 tttttatttt gttaattatg tttttagggg atgggaggtg gtgtgtgttt ttctcctctt 
5881 ggttttcatt tgctcaagca cacaaaaggg acttttgttt actctatcat gaacaaagga 
5941 actgtccaca tactgtaaac catgagcagt gttgttgttg tttttttaaa cagtatattt 
6001 ggtggtctct ttgtctgctt ttatttccag ttcgatcttc tgggtttagt tttgtcttta 
6061 aaaagaattc aaaaaaagac tgacaatgac agttttgagt tggatagtga aaaagtggag 
6121 cctccataat cagtgtggtt gccttcagac ctgagtactt agctgagggt gggtgagagc 
6181 cctttgttcc aaaagtccat tagttttgct gttgtttagg agtaggtggt tgttgttgtt 
6241 tttacttttt ttttttttgg cttttgtaat ggaatccatg ttcacatcct gtgaactctg 
6301 tctcctgaaa ccctgaagta tttcctagaa cctgaaatat tgttttcttc cttgaatttt 
6361 ctctagaaat gcagaaatta ggaaggtgat gggtctgtat ccccctgccc ccttctttcc 
6421 catgcttgac tcccgagagt actggcagtg cagcacccat cattgacgta agcggctgtc 
6481 ttgtccagtg tttgtaggca tcactgcctc agctgttaag gagacctgtg tcaaaactta 
6541 catccacatt cctacacccc cacaacccat cacttctggt gttaacccta aaatacccac 
6601 atgtattgag ctggtcttct gcatttaagt atttctcccc agtttttttc cccactgtgt 
6661 gtggggggag ggtccataaa cccgagtgtg cctttgcttt ccacccttgc tagacactgg 
6721 tagatgcaac aaactcagat ttatatttgt tgtaaagttg taaaaatatt gtgatgtcac 
6781 caattttcct tccatctcca catcccctaa catctgattc acgacttaat gtatgttgta 
6841 agaaaagaaa aaagaaaaga aaaaaaaaaa aaaaa 
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LOCUS (LOC) : BM182435 GenBank (R) 

GenBank ACC. NO. (GBN) : BM182435 

GenBank VERSION (VER) : BM182435.1 GI: 17513393 

CAS REGISTRY NO. (RN) : 376921-60-7 

SEQUENCE LENGTH (SQL) : 717 

MOLECULE TYPE (ci) : mRNA; linear 

DIVISION CODE (CI) : Expressed sequence tag 

DATE (DATE): 11 Dec 2001 

definition (DEF) : fv61gll.xl sugano SJD adult male Danio rerio cDNA clone 

5413076 3' similar to TR:035464 035464 

* **SEMAPHORIN*** *** V IA*** PRECURSOR . ; contai ns 

element LTR3 repetitive element mRNA sequence. 
SOURCE: zebrafish. 
ORGANISM (ORGN): Danio rerio 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopterygii; Teleostei; 
ostariophysi ; Cypriniformes ; Cyprinidae; Danio 
NUCLEIC ACID COUNT (na) : 153 a 141 c 195 g 228 t 
COMMENT: 

Contact: Stephen L. Johnson 

Washington university school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email : zbrafish@watson .wustl .edu 

Library constructed by Dr. sumio Sugano and Dr. Koichi Kawakami DNA 
sequencing by: Washington University Genome Sequencing Center Clone 
distribution information can be found through the I.M.A.G.E. 
Consortium/LLNL, send email to: info@image.Tlnl.gov 
Seq primer: T7 from Gibco 
High quality sequence stop: 463. 
REFERENCE: 1 (bases 1 to 717) 



AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) 



Clark, M.; Johnson, s.L. ; Lehrach,H.; Lee,R.; Li , F . ; 
Marra,M.; Eddy,S.y; Hillier,L.; Kucaba,T.; Martin, J.; 
Beck,c.; Wylie,T.; Underwood, K. ; Steptoe.M.; 
Theising,B.; Allen, M.; Bowers, Y.; Person, B.; 
Swaller,T.; Gibbons, M.; Pape,D.; Harvey, N.; schurk,R. 
Ritter,E.; Kohn,S.; Shin,T.; Jackson, Y. ; Cardenas, M.; 
McCann,R. ; Wilson ,R. 
WashU Zebrafish EST Project 1998 
unpublished (1998) 



FEATURES (FEAT) 
Feature Key 



Location 



Qual ifier 



source 1..717 /organism= n Danio rerio" 

/db-xref="taxon:7955" 
/clone="5413076" 

/clone-lib="Sugano SJD adult male" 
/sex="male" 

/tissue-type= M whole body" 
/dev-stage="adul t " 
/lab-host="DHlOB (phage 
resistant)" 

/note="vector : pMEl8s-FL3; site-1: 
Drain (CACCATGTG); site-2: Drain 
(CACTGTGTG) ; 1st strand cDNA was 
primed with an oligo(dT) primer 
[ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was li gated 
to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and 
cloned into distinct Drain sites 
of the pMEl8S-FL3 vector (5' site 
CACTGTGTG, 3' site CACCATGTG). 
Xhol should be used to isolate the 
cDNA insert. Size selection was 
performed to exclude fragments 
<1.5kb. Library constructed and 
donated by Dr. sumio sugano 
(University of Tokyo Institute of 
Medical science). Custom primers 
for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3' end 
primer CGACCTGCAGCTCGAGCACA . " 

SEQUENCE (SEQ) : 

1 tttttttttt tttttttttc aatgaaagaa accagattta ttgacaataa ccaccctcca 
61 ccactgcata atcattcagt gtgcaccata ataacatccc atctcttata tttacactat 
121 atacacacta tatttacaca ctgcagtcat caatgtgctg gtattctgac ccctgagtgt 
181 gtgggctcag ttgtgtgttt gtttgtgtgt gtcatgtgaa cacatggtca gtagttgtgt 
241 ttgtgtcatg tgatcgcggg gtcagtagtt gtgtgtcgtg atcgcggggt tagtagttgc 
301 atgtgtgtgt tgtgtgacca gtgttgtccc acagggtcaa tagttgtgtg tgtatgccat 
361 gtgactgtgt ggtcaatagt tgtgcgtgta atgtgagcgc gaggtcagta tttttgtgtg 
421 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtt ggtcaaaatc ttcagcacgc gtccgtcttc 
481 agagcccaga aacactacag tgcggttctt atgaggccca gcagcagtgt ccaccacaat 
541 ctgagtcagc ttatacctgc tggcggtgtt ggtgatcagc ggttggctgt tgactgacgg 
601 cacgctctcc tccatcagcg ggtgtgtttt gatgaacgac agcacagagt ctggaaactg 
661 aacagaagag gaataggagg aggcagaacc ctcacctgca cacgaaccgt gggttgg 
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LOCUS (LOC): 
GenBank ACC. no. (gbn) 
GenBank version (ver) : 
cas registry no. (rn): 

SEQUENCE LENGTH (SQL): 

molecule type (ci): 

DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



SOURCE : 
ORGANISM (ORGN): 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BM180699 GenBank (R) 
BM180699 

BM180699.1 Gl: 17404770 

376286-68-9 

561 

mRNA; linear 
Expressed sequence tag 
7 Dec 2001 

daj83c05.yl NICHD XGC Lil Xenopus laevis cDNA clone 
IMAGE: 5129457 5' similar to TR:035464 035464 

***SEMAPHORIN*** *** V IA*** PRECURSOR. ;, mRNA 

sequence. 

African clawed frog. 
Xenopus laevis 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Amphibia; Batrachia; Anura; 



Mesobatrachia; Pipoidea; Pipidae; xenopodinae; xenopus 
NUCLEIC ACID COUNT (NA) : 133 a 126 c 115 g 187 t 

COMMENT: 

Contact: Robert Strausberg, Ph.D. 
Emai 1 : cgapbs- r@mai 1 . ni h . gov 

Tissue Procurement: Martha Rebbert, Steven L. Klein, Ph.D. 
cDNA Library Preparation: Life Technologies, inc. 
cDNA Library Arrayed by: The I.M.A.G.E. consortium (LLNL) 
DNA Sequencing by: Washington University Genome Sequencing Center 
clone distribution: Xenopus clones from this library are available 
through the I.M.A.G.E. Consortium/LLNL at: info@image.llnl.gov 
Seq primer: -40RP from Gibco 
High quality sequence stop: 444. 
REFERENCE: 1 (bases 1 to 561) 

AUTHOR (AU): NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP), Tumor Gene Index 
JOURNAL (SO): Unpublished (1997) 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1. .561 



SEQUENCE (SEQ): 
1 gctgcaagcc 
61 aatagaaagc 
121 gatttatagc 
181 gaaatttttc 
241 cgcatcccgg 
301 ttttcccaag 
361 aaaaacagtg 
421 atttctattt 
481 ttgtcttggc 
541 atgatatggc 



/organism="Xenopus laevis" 
/db-xref= M taxon:8355" 
/clone="IMAGE: 5129457" 
/clone-lib="NICHD XGC Lil" 
/lab-host="DHlOB 
(phage- resistant)' 1 
/note="organ: liver; Vector: 
PCMV-SP0RT6; site-1: Notl; site-2: 
sail; cloned unidi recti onally. 
Primer: oligo dT. Average insert 
size 1.4 kb. Constructed by Life 
Technologies. Note: This is a 
xenopus Gene Collection (XGC ) 
library. " 



tcctgcccct 
taagcagact 
tttgcttcaa 
ttttgttggc 
catttttaac 
agtcgcccag 
gacatctttc 
caacattttg 
tactttttca 
tgacatcgct 



ttgctgtgtt 
ttctgaaaat 
atggcacatt 
atttcttctt 
ctgatacaac 
gtgtgcaaaa 
ctaaaagctc 
caagctgtta 
acaccgtaca 
a 



ggagatagcg 
tatattttgc 
cttatcttcc 
aaatgatgtt 
aaaaccttgt 
atgacatggg 
gactgaattg 
cagacgtcat 
acagcatccc 



aatgtgcttt 
agacatgtga 
agaacacatg 
ggcatcttcg 
ttttttccct 
gggatctcag 
ctcggttcct 
tcatatcaat 
cggctctgct 



tgtctgtgga 
tcttccaggg 
gttctataat 
tatcccctag 
ctctaggtgg 
agggtacttg 
ggggattccc 
ggccgagacg 
gtttgtgcct 
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LOCUS (loc): 

GenBank ACC. NO. (GBN) 

GenBank VERSION (VER) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN) 



BG816002 GenBank (R) 
BG816002 

BG816002.1 Gl: 14186982 
338672-75-6 
589 

mRNA ; linear 
Expressed sequence tag 
22 May 2001 

dad39a04.yl Wellcome CRC pCSl07 tropical is Stl0-12 
silurana tropical is cDNA clone IMAGE:4461607 5' similar 
to TR: 035464 035464 ***SEMAPHORIN*** *** VIA *** 
PRECURSOR. ;, mRNA sequence, 
western clawed frog, 
silurana tropical is 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Amphibia; Batrachia; Anura; 
Mesobatrachia; Pipoidea; Pipidae; Xenopodinae; silurana 
NUCLEIC ACID COUNT (NA) : 154 a 125 c 145 g 165 t 
COMMENT: 

Contact: Sandy Clifton, Ph.D. 
WashU Xenopus EST project, 1999 
Washington University school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel : 314 286 1800 
Fax: 314 286 1810 



Email: est@watson.wustl.edu 

Library constructed by A. Zorn and J . Mason (Wellcome/CRC institute 
). DNA Sequencing by: Washington University Genome Sequencing 
Center 

clone distribution: Xenopus clones from this library are available 
through the i.M.A.G.E. Consortium/LLNL at: info@image.llnl.gov 
High quality sequence stop: 513 



REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) : 

FEATURES (FEAT) : 
Feature Key 



1 (bases 1 to 589) 
clifton,S.; Johnson, S.L. ; Blumberg,B.; Song, J.; 
Hillier,L.; Pape t D., Martin, J. ; Wylie,T.; 
Underwood, K. ; Theising,B.; Bowers, Y.; Person,B.M; 
Gi bbons , M . ; Harvey , N . ; Ri tt e r , E . ; J ackson , Y . ; 
McCann,R.; Waterston , R. ; Wilson,R. 
WashU Xenopus EST project, 1999 
Unpublished (1999) 



Location 



Qualifier 



source 1..589 /organism="Silurana tropicalis" 

/db-xref="taxon:8364" 
/clone= ,, IMAGE:4461607" 
/clone-lib= n Wellcome CRC pCSl07 
tropicalis stlO-12" 
/tissue-type="whole embryo, stages 
10-12" 

/lab-host="DHlOB 
(phage- resistant)" 
/note="vector: pcsl07; Site-1: 
Notl; Site-2: EcoRI; cDNAs were 
oligo-dT primed and di rectionally 
cloned. Average insert size 1.5 
kb, range 0.5-4 kb. Library 
constructed by A. Zorn and J. 
Mason (Wellcome/CRC Institute). " 

SEQUENCE (SEQ): 

1 cccggggcct tgtttgtgtc tttctctggc tgcattgtca gactgccttt gagccgctgt 
61 gagcgatacg gatcatgcaa aaagtcttgt attggtgcaa gggatcctta ttgtggctgg 
121 ctgaaccatg agtcctgcgg aagagtgaaa ccgggcatgt tttttgggta tgaacaggat 
181 gtggaatacg gcaacacagt acagcttggc gactgtcatg acattttgcc tactgcaact 
241 acaccagatt acaaaatatt tggcgatcca acatctgaca ttgaagttcc ttctacttct 
301 gtaaccacca caagtagccc agttatttta cctaaagtga ttggttcctg gaaacccaga 
361 atgactggct cacggaaatt tatataccaa gatgacccaa acacctcaga gtatattgat 
421 ccgttaacag gtgtccaaaa gggtgtacgg tgggaagtac agtcaggaga cgccaaccaa 
481 atggtccata tgaatgtttt gatcacctgc gttcttgctg cttttgtgct cggagctttc 
541 atcgcaggag tgatagttta ctgctacagg gagatgtttg taaggaaaa 
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LOCUS (LOC): 
GenBank ACC. NO 
GenBank VERSION 
CAS REGISTRY NO 
SEQUENCE LENGTH 
MOLECULE TYPE 
DIVISION CODE 
DATE (DATE): 
DEFINITION (DEF) 



clone PLACE4000230, 
***semaphorin*** 



AK027867 GenBank (R) 
(GBN): AK027867 
(VER) : AK027867 .1 GI : 14042853 

(RN): 390638-15-0 
(SQL) : 6060 
(CI): mRNA ; linear 
(CI): primates 

30 Jan 2004 

Homo sapiens cDNA FL J 14961 fis, 
highly similar to Mus musculus 
***via*** mRNA. 
KEYWORDS (ST): oligo capping; fis (full insert sequence) 

SOURCE: Homo sapiens (human) 

organism (orgn) : Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini 
Hominidae; Homo 

COMMENT : 

nedo human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology; cDNA library construction, 
5'- & 3' -end one pass sequencing and clone selection: Helix 
Research Institute (supported by Japan Key Technology Center etc.) 
and Department of Virology, institute of Medical science, 
University of Tokyo. 
REFERENCE: 1 



AUTHOR (AU): 



TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



ota,T.; Suzuki ,Y.; Nishikawa,T. ; otsuki.T.; 
Sugiyama.T.; irie.R.; Wakamatsu,A. ; Hayashi.K.; 
Sato.H.; Nagai,K.; Kimura.K.; Makita,H.; Sekine,M.; 
obayashi ,M. ; Nishi,T.; shibahara,T. ; Tanaka.T.; 
Ishii,s.; Yamamoto, J.; saito,K.; Kawai,Y.; lsono,Y.; 
Nakamura,Y. ; Nagahari, K.; Murakami ,K.; Yasuda,T.; 
iwayanagi ,T. ; Wagatsuma,M. ; shiratori ,A. ; sudo,H.; 
Hosoiri,T.; Kaku f Y.; Kodaira,H.; Kondo,H.; sugawara.M.; 
Takahashi ,M. ; Kanda,K.; Yokoi,T.; Furuya,T.; 
Kikkawa,E.; Omura,Y.; Abe,K.; Kamihara,K. ; Katsuta,N.; 
Sato,K.; Tanikawa,M.; Yamazaki,M.; Ninomiya,K.; 
ishibashi ,T. ; Yamashita.H. ; Murakawa,K.; Fujimori, K.; 
Tanai,H.; Kimata,M.; Watanabe.M.; Hiraoka,s.; Chiba,Y. ; 
lshida,s.; Ono,Y.; Takiguchi ,s. ; Watanabe,s. 
Yosida,M.; Hotuta,T.; Kusano,J.; Kanehori,K. 
Takahashi -Fuji i , A. ; Hara,H.; Tanase,T.; Nomura, Y. ; 
Togiya,S.; Komai,F.; Hara,R.; Takeuchi,K.; Arita,M.; 
Imose,N.; Musashino,K. ; Yuuki,H.; Oshima,A.; Sasaki ,N. 
Aotsuka,s.; Yoshikawa, Y. ; Matsunawa,H. ; lchihara,T.; 
Shiohata,N. ; Sano,S.; Moriya,S.; Momiyama.H. ; Satoh,N. 
Takami,s.; Terashima, Y. ; Suzuki, 0.; Nakagawa,S.; 
senoh,A.; Mizoguchi , H . ; Goto,Y.; shimizu,F.; Wakebe,H. 
Hishigaki ,H. ; Watanabe.T.; Sugiyama,A.; Takemoto,M.; 
Kawakami , B . ; Yamazaki , M . ; Watanabe , K . ; Kumagai ,A. ; 
ltakura,s.; Fukuzumi,Y.; Fujimori ,Y. ; Komiyama,M.; 
Tashiro,H.; Tanigami,A.; Fujiwara,T. ; Ono.T.; 
Yamada,K.; Fujii,Y.; Ozaki.K.; Hirao,M.; ohmori f Y. ; 
Kawabata,A.; Hikiji,T.; Kobatake,N. ; lnagaki,H.; 
lkema,Y.; okamoto,S.; okitani,R.; Kawakami, T.; 
Noguchi.S.; Itoh,T.; Shigeta,K.; Senba,T. ; 
Matsumura.K. ; Nakajima, Y. ; Mizuno,T.; Morinaga,M.; 
Sasaki, M.; Togashi,T.; Oyama,M.; Hata,H.; watanabe, M.; 
Komatsu,T.; Mi zushima-Sugano , J . ; Satoh,T.; shirai,Y.; 
Takahashi ,Y. ; Nakagawa,K.; Okumura.K.; Nagase,T.; 
Nomura, N.; Kikuchi,H.; Masuho,Y.; Yamashita, R. ; 
Nakai.K.; Yada,T.; Nakamura,Y.; ohara,0.; Isogai,T.; 
sugano.s. 

Complete sequencing and characterization of 21,243 

full-length human cdnas 

Nat. Genet., 36 (1), 40-45 (2004) 

CA 140:158332 

2 

lsogai,T.; ota.T. ; Hayashi,K.; Sugiyama,T.; otsuki,T.; 
Suzuki, Y.; Nishikawa,T. ; Nagai,K.; sugano,S.; 
Takahashi -Fuji i ,A. ; Hara,H.; Tanase,T.; Nomura, Y. ; 
Toqiya,s.; Komai,F.; Hara,R.; Takeuchi.K.; Arita,M.; 
Nabekura,T.; Ishii,s.; Kawai,Y.; Saito,K.; Yamamoto, J.; 
Wakamatsu,A. ; Nakamura,Y. ; Nagahari, K.; Masuho,Y.; 
oshima,A. 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 6060) 

Isogai,T.; otsuki ,T. 

Direct submission 

submitted (10-MAY-2001) Takao isogai , Helix Research 
institute, Genomics Laboratory; 1532-3 Yana, Kisarazu, 
Chi ba 292-0812, Japan (E-mail :genomics@hri -co.jp, 
Tel =81-438-52-3975, Fax:81-438-52-3986) 



FEATURES (FEAT): 
Feature Key 



Location 
1. .6060 



Qualifier 



source 



CDS 



89. .1345 



/orqanism="Homo sapiens" 

/mol-type= ,, mRNA M 

/db-xref ="taxon : 9606" 

/cl one=" PLACE40002 30 " 

/tissue-type^'placenta" 

/clone-lib= M PLACE4" 

/note="cloning vector: pMEl8SFL3" 

/note="unnamed protein product' 1 

/codon-start=l 

/protei n-i d= M BAB55418 . 1" 

/db-xref ="Gl: 14042854" 

/translation="MlMNGTLYlAARDHIYTVDI 

DTSHTEEIYCSKKLTWKSRQADVD 

TCRMKGKHKDECHNFIKVLLKKNDDALFVCGTNA 



SEQUENCE (SEQ): 

1 caactataca aaacagtatc cggtgtttgt 
61 gaggcacagg ctggacatcc aggtgattat 
121 tagggaccat atttatactg ttgatataga 
181 caaaaaactg acatggaaat ctagacaggc 
241 acataaggat gagtgccaca actttattaa 
301 gtttgtctgt ggaactaatg ccttcaaccc 
361 ggaaccattc ggggatgaat tcagcggaat 
421 caacgttgca ctgtttgcag atggaaaact 
481 cattgacgca gtcatttacc ggagtcttgg 
541 cgattcaaaa tggttgaaag aaccatactt 
601 ctacttcttc ttcagggaaa tagcagtgga 
661 aagagtggct caggtttgta agaatgatat 
721 gtggacgtcg ttcctgaagg cgcgcttgaa 
781 tttcaacatt ctccaggcag ttacagatgt 
841 ggcaacgttt tctacacctt ataacagcat 
901 gcttgacatt gccagtgttt ttactgggag 
961 ctggacacca gttcctgatg aacgagttcc 
1021 atcctcctta gaaagatatg caacctccaa 
1081 caagacgcac ccgctcatgg atgaggcagt 
1141 gagaacaatg gtcagatacc gccttaccaa 
1201 tcagaatcac actgtggttt ttctgggatc 
1261 cagaatagga aatagtggtt ttctaaatga 
1321 caactctgaa aaaatgcagc tatgatggag 
1381 tggacagagc aagcagctct ctgtatgttg 
1441 ttggccggtg tgaacgacat gggaagtgta 
1501 tattgtggat ggataaagga aggtggtgcc 
1561 acttttgagc aggacataga gcgtggcaat 
1621 tttgtggcac tgaatgggca ttctagttcc 
1681 acggctcaag aggggtatga gtctagggga 
1741 tcacctgaca gcacagaccc tttgggggca 
1801 ggagtgattc gggaaagtta cctcaaaggc 
1861 gccattgcag tcatcctggc tttcgtcatg 
1921 tgcgtctgtg atcatcggcg caaagacgtg 
1981 acccactcgc gccggggctc catgagcagc 
2041 actcaatccg aagacccaaa gccggaggcc 
2101 ctcgccactc ccggcaacac ggccaagatg 
2161 ctgacggccc tccccacccc agagtcaacc 
2221 cgcggcagcc gcgagtggga gaggaaccag 
2281 ccccccatgg gctcccctgt gattcccacg 
2341 atccccagcg tggtggtcct gcccatcacg 
2401 cagcccaaaa tgagcgaggt ggcccagatg 
2461 tataagacca tcaaggaaca tctcagcagc 
2521 gagaacctgg acagcctgcc ccccaaagtt 
2581 ggagcctccc tgtctcagac cggtctaagc 
2641 tacggggttg actataagag gagctacccc 
2701 accactctca aaagaaacaa cactaactcc 
2761 agctttggca ggggagacaa cccgccgccc 
2821 cacagctccc agccatctgg ccaggccgtg 
2881 tacaactcac tgacaaggtc ggggctgaag 
2941 cccaaaccat cctttgctcc cctttccaca 
3001 tcccaggggg agggggtcag gtgtcgaacc 
3061 agcaaggttc tcaactgcct cgagtaccca 
3121 ggacgctggg tcctcctctc tgggacacag 
3181 tggtgaaggt ttgcaacggc ggggactcac 
3241 ccctacaaca ggtcggaccc acaaaagact 
3301 cacataccta ccccatcccc cacccccaca 
3361 acacatacac acacacgcac agaggtgaac 
3421 cgggacgtgg ccagactggg attgcgttcc 
3481 atcaagaaaa tttaaaaaga caaaaaaaaa 
3541 taacaatggc agaagtttac tatgcgcaaa 
3601 ttctgttgca gcagataaat gccatgttgg 
3661 ctcttttaat gaaataacgt gaccgttaac 



FNPSCRNYKMDTLEPFGDEFSGMA 

RCPYDAKHANVALFADGKLYSATVTDFLAIDAVI 

YRS LGES PTLRTVKHDSKWLKEPY 

FVQAVDYGDYIYFFFREIAVEYNTMGKWFPRVA 

QVCKNDMGGSQRVLEKQWTSFLKA 

RLNCSVPGDSHFYFNILQAVTDVIRINGRDWLA 

T FST P YN S I PGS AVCAYDM LD IAS 

VFTGRFKEQKSPDSTWTPVPDERVPKPRPGCCAG 

SSSLERYATSNEFPDDTLNFIKTH 

PLMDEAVPSIFNRPWFLRTMVRYRLTKIAVDTAA 

GPYQNHTWFLGSEKGIILKFLAR 

IGNSGFLNDSLFLEEMSVYNSEKMQL" 



gggccacaag ccaggacgga acaccacaca 
gatcatgaac ggaaccctct acattgctgc 
cacatcacac acggaagaaa tttattgtag 
cgatgtagac acatgcagaa tgaagggaaa 
agttcttcta aagaaaaacg atgatgcatt 
ttcctgcaga aactataaga tggatacatt 
ggccagatgc ccatatgatg ccaaacatgc 
atactcagcc acagtgactg acttccttgc 
agaaagccct accctgcgga ccgtcaagca 
tgttcaagcc gtggattacg gagattatat 
gtataacacc atgggaaagg tagttttccc 
gggaggatct caaagagtcc tggagaaaca 
ctgctcagtt cctggagact ctcattttta 
gattcgtatc aacgggcgtg atgttgtcct 
ccctgggtct gcagtctgtg cctatgacat 
attcaaggaa cagaagtctc ctgattccac 
taagcccagg ccaggttgct gtgctggctc 
tgagttccct gatgataccc tgaacttcat 
gccctccatc ttcaacaggc catggttcct 
aattgcagtg gacacagctg ctgggccata 
agagaaggga atcatcttga agtttttggc 
cagccttttc ctggaggaga tgagtgttta 
tcgaagacaa aaggatcatg ggcatgcagc 
cgttctctac ctgtgtgata aaggttcccc 
aaaaaaacct gtattgcctc cagagaccca 
tgcagccatt tatcacccaa cagcagactg 
acagatggtc tgggggactg tcacaattcc 
ctcttgccca gcacaaccac atcagattcg 
ggaatgctgg actggaagca tctgcttgac 
gtgtcttccc ataatcacca agacaagaag 
cacgaccagc tggttcccgt caccctcttg 
ggggccgtct tctcgggcat caccgtctac 
gctgtggtgc agcgcaagga gaaggagctc 
gtcaccaagc tcagcggcct ctttggggac 
atcctcacgc cactcatgca caacggcaag 
ctcattaaag cagactagca ccacctggac 
ccaacgctgc agcagaagcg gaagcccagc 
aacctcatca atgcctgcac aaaggacatg 
gacctgcccc tgcgggcctc ccccagccac 
cagcagggct accagcatga gtacgtggac 
gcgctggagg accaggccgc cacactggag 
aagagtccca accatggggt gaaccttgtg 
ccacagcggg aggcctccct gggtcccccg 
aagcggctgg aaatgcacca ctcctcttcc 
acgaactcgc tcacgagaag ccaccaggcc 
tccaattcct ctcacctctc cagaaaccag 
gccccgcaga gggtggactc catccaggtg 
actgtctcga ggcagcccag cctcaacgcc 
cgtacgccct cgctaaagcc ggacgtaccc 
tccatgaagc ccaatgatgc gtgtacataa 
agcaggcaag gcgaggtgcc cgctcagctc 
ccagaccaag aaggcctgcg gcagagccga 
gggtactcac gaaaactggg ccgcgtggtt 
cttcattctc ttccttcact ttcccccaca 
tcagttatca tcacaaacat gagccaaaag 
cacacacaca cacacacaca catgcacaca 
agaaactgaa acattttgtc cacaacttca 
aacctgcaaa acacaaatac attttttaaa 
agaattcatt gataattcta actcagactt 
tactgtgaaa tgcccgccag tgttacagct 
gcaactatgt catagatttc tgctcctcct 
gcaagtaact ctttatttat tgttcaccct 



3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 



ttttttcctt 
ccattgaaag 
tttttacaat 
gactgtctgg 
agttcattgc 
agtccttcca 
catatctcat 
gcttgcaata 
ggtgtgaact 
cttcgtgggg 
ttgagttgtg 
cagatattta 
ttggagtcaa 
gctgggcagc 
cctttggctg 
tcatcctcaa 
ctagcagaaa 
caggcggtgc 
cacgtgcagt 
gtgattaacg 
ctggggcaaa 
gaggtggtgt 
ttgtttactc 
ttgttgtttt 
atcttctggg 
tttgagttgg 
agtacttagc 
gtttaggagt 
ccatgttcac 
aatattgttt 
tgtatccccc 
cccatcattg 
ttaaggagac 
ctggtgttaa 
tccccagttt 
gctttccacc 
agttgtaaaa 
gattcacgac 
gaaaagcaag 



aaggaaagga 
ttaaactatt 
aaattcactg 
tctgtagcta 
ctggtcaggt 
gatcctaatt 
tgatgccaac 
tgccatccca 
aatatgctgg 
ttacatctgt 
tacacttgta 
tttaaccctc 
gcagatggaa 
accacaccct 
tgctgcgaag 
gagctccctg 
agggactaac 

gcggttgggc 

gcacacaaag 
agtaactcca 
ggatttttta 
gtgtttttct 
tatcatgaac 
tttaaacagt 
tttagttttg 
atagtgaaaa 
tgagggtggg 
aggtggttgt 
atcctgtgaa 
tcttccttga 
tgcccccttc 
acgtaagcgg 
ctgtgtcaaa 
ccctaaaata 
ttttccccac 
cttgctagac 
atattgtgat 
ttaatgtatg 
gaaaaggctc 



ctcttccaaa 
taacgtgaaa 
agtaaataag 
aaggaaaaag 
ctctggccta 
tctaaagcaa 
atggttttgt 
tttcaccttg 
ctgctacctt 
gaatagcctg 
aactctttat 
ccttccccac 
gaatgcagtg 
ccaattcaca 
catggacccc 
cttcccttag 
gtcccattcc 
agacacctgg 
aaatgacatg 
agtacaaggc 
ttttattttt 
cctcttggtt 
aaaggaactg 
atatttggtg 
tctttaaaaa 
agtggagcct 
tgagagccct 
tgttgttttt 
ctctgtctcc 
attttctcta 
tttcccatgc 
ctgtcttgtc 
acttacatcc 
cccacatgta 
tgtgtgtggg 
actggtagat 
gtcaccaatt 
ttgtaagaaa 
tttattactt 



tatcatccta 
tccattaact 
ttggagctgg 
ttaggagggg 
taaacatgca 
tcaggagaga 
ccatggttct 
catgtgagac 
gcataaatta 
ttttacacat 
gatcaactgt 
cctctgcccc 
gtgatagttg 
ctgccttcta 
agtgttgttg 
attatttcaa 
tcttttctgc 
gaggagtctc 
gaaatagatg 
cgaccacaat 
tattttgtta 
ttcatttgct 
tccacatact 
gtctctttgt 
gaattcaaaa 
ccataatcag 
ttgttccaaa 
actttttttt 
tgaaaccctg 
gaaatgcaga 
ttgactccca 
cagtgtttgt 
acattcctac 
ttgagctggt 

gggagggtcc 

gcaacaaact 
ttccttccat 
agaaaaaaga 
aaaagtaata 



tgaacagctc 
ggaataattg 
aattctgagc 
aatatttatt 
aaaaaactaa 
aagcttttag 
gactttctgt 
agcaaacaaa 
atgatttgat 
gtaaatttgt 
tccccctttt 
acttccagcc 
tcatgcgaca 
gttgtgccaa 
tgggtgtgtc 
tacggtgata 
tgcgtccact 
caagccatgt 
caggcaggct 
ggatgctgca 
attatgtttt 
caagcacaca 
gtaaaccatg 
ctgcttttat 
aaaagactga 
tgtggttgcc 
agtccattag 
tttggctttt 
aagtatttcc 
aattaggaag 
agagtactgg 
aggcatcact 
acccccacaa 
cttctgcatt 
ataaacccga 
cagatttata 
ctccacatcc 
aaagaaaaaa 
aaacctgact 



ttcagaaagc 
agtttcttta 
tttgtgtttg 
taaatctatc 
ttcatgttaa 
aatggcacat 
gaaggcacca 
atccacaaat 
cacacgggtt 
gccttacacc 
gaaataagtg 
ctctgaaaga 
gcctgagaac 
ctggaaccac 
aaatcccctt 
tccttatttg 
ggctagagag 
gcacagcaca 
ggtccctgct 
aaaacgttga 
taggggatgg 
aaagggactt 
agcagtgttg 
ttccagttcg 
caatgacagt 
ttcagacctg 
ttttgctgtt 
gtaatggaat 
tagaacctga 
gtgatgggtc 
cagtgcagca 
gcctcagctg 
cccatcactt 
taagtatttc 
gtgtgccttt 
tttgttgtaa 
cctaacatct 
agggaaaaaa 
gttctatatt 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



GenBank (R) 
Gl: 14042491 



KEYWORDS (ST): 
SOURCE : 
ORGANISM (ORGN): 



AK027654 
AK027654 
AK027654.1 
390635-08-2 
2306 

mRNA; linear 
Primates 
30 Jan 2004 

Homo sapiens cDNA FL J 14748 fis, clone NT2RP3002869 , 
highly similar to mus musculus ***semaphorin*** 

***via*** mRNA. 
oligo capping; fis (full insert sequence) 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

COMMENT: 

NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and industry of Japan; CDNA full insert sequencing: 
Research Association for Biotechnology; cDNA library construction, 
5'- & 3'-end one pass sequencing and clone selection: Helix 
Research institute (supported by Japan Key Technology Center etc.) 
and Department of Virology, Institute of Medical Science, 
University of Tokyo. 
REFERENCE: 1 

author (AU): ota,T.; Suzuki, Y.; Nishikawa,T. ; otsuki,T.; 

sugiyama,T,; lrie,R.; Wakamatsu,A. ; Hayashi.K.; 

Sato,H.; Nagai,K.; Kimura,K.; Makita,H.; Sekine,M.; 

obayashi,M.; Nishi.T.; shibahara,T. ; Tanaka.T. ; 

lshii,s.; Yamamoto,J.; saito,K.; Kawai.Y.; lsono,Y.; 

Nakamura,Y.; Nagahari,K.; Murakami, K.; Yasuda,T.; 

Iwayanagi ,t. ; wagatsuma,M. ; shiratori ,A. ; sudo,H.; 

Hosoiri,T.; Kaku,Y.; Kodaira,H.; Kondo,H.; Sugawara,M.; 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



Takahashi ,M. ; Kanda,K.; Yokoi,T.; Furuya,T.; 
Kikkawa,E.; omura,Y.; Abe,K.; Kamihara,K. ; Katsuta,N.; 
sato,K. ; Tanikawa,M. ; Yamazaki ,M. ; Ninomiya,K. ; 
ishibashi ,T. ; Yamashita,H. ; Murakawa,K. ; Fujimori ,K. ; 
Tanai,H.; Kimata,M.; Watanabe f M.; Hiraoka,s.; chiba,Y. ; 
lshida,s.; Ono,Y.; Takiguchi ,S. ; Watanabe, s. 
Yosida,M.; Hotuta,T. ; Kusano,J.; Kanehori,K. 
Takahashi -Fuji i ,A. ; Hara,H.; Tanase,T.; Nomura f Y.; 
Togiya,S.; Komai,F.; Hara,R.; Takeuchi,K.; Arita,M.; 
lmose,N.; Musashino,K. ; Yuuki.H.; Oshima,A. ; Sasaki, N. 
Aotsuka,S.; Yoshikawa, Y. ; Matsunawa,H. ; lchihara,T.; 
shiohata,N.; sano,S.; Moriya,S.; Momiyama,H.; Satoh,N. 
Takami,s.; Terashima, Y. ; Suzuki, o.; Nakagawa.S.; 
Senoh.A.; Mizoguchi ,H . ; Goto,Y.; shimizu,F.; Wakebe,H. 
Hishigaki ,H. ; Watanabe,T.; Sugiyama,A. ; Takemoto.M.; 
Kawakami , B . ; Yamazaki , M . ; Watanabe , K . ; Kumagai , A . ; 
Itakura,s.; Fukuzumi.Y.; Fujimori, Y. ; Komiyama,M.; 
Tashiro,H.; Tanigami ,A. ; Fujiwara,T.; Ono,T.; 
Yamada,K.; Fujii.Y.; Ozaki.K.; Hirao,M.; ohmori,Y.; 
Kawabata.A.; Hikiji,T.; Kobatake,N.; Inagaki,H.; 
Ikema,Y.; okamoto,S.; okitani,R.; Kawakami J.; 
Noguchi,s.; ltoh,T.; shigeta,K.; Senba,T.; 
Matsumura,K. ; Nakajima,Y.; Mizuno,T.; Morinaga,M.; 
Sasaki ,M.; Togashi,T.; oyama,M.; Hata,H.; Watanabe, M.; 
Komatsu,T.; Mi zushi ma-Sugano , J . ; satoh,T.; shirai.Y.; 
Takahashi ,Y. ; Nakagawa,K.; Okumura,K.; Nagase,T.; 
Nomura, N.; Kikuchi,H.; Masuho,Y.; Yamashita,R. ; 
Nakai,K.; Yada,T. ; Nakamura,Y. ; 0hara,0.; Isogai ,T.; 
Sugano,s . 

Complete sequencing and characterization of 21,243 

full-length human cDNAs 

Nat. Genet., 36 (1), 40-45 (2004) 

CA 140:158332 

2 

Isogai, T.; ota,T.; Hayashi,K.; Sugiyama.T.; otsuki,T.; 

Suzuki , Y. ; Nishikawa,T. ; Nagai,K.; Sugano,s.; 

Shiratori ,A. ; sudo,H.; Wagatsuma,M. ; Hosoiri,T.; 

Kaku,Y.; Kodaira,H.; Kondo.H.; Sugawara,M.; 

Takahashi ,M. ; chiba,Y.; Ishida,s.; Murakawa,K.; ono,Y.; 

Takiguchi ,s. ; Watanabe, S.; Kimura,K.; Murakami, K.; 

-Ishii.s.; Kawai,Y.; Saito,K.; Yamamoto,J.; 

Wakamatsu,A. ; Nakamura.Y.; Nagahari,K.; Masuho,Y.; 

Ni nomi ya , K . ; Iwayanagi , T . 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 2306) 

Isogai ,T. ; otsuki ,T. 

Direct Submission 

Submitted (10-MAY-2001) Takao Isogai, Helix Research 
institute, Genomics Laboratory; 1532-3 Yana, Kisarazu, 
Chi ba 292-0812, Japan (E-mail : genomicsOhri .co.jp, 
Tel :81-438-52-3975, Fax:81-438-52-3986) 



FEATURES (FEAT) 
Feature Key 



Location 



Qua! ifier 



source 



1. .2306 



CDS 



109. .1833 



/organism="Homo sapiens" 

/moT-type= ,, mRNA ,, 

/db-xref="taxon:9606" 

/clone="NT2RP3002869" 

/cell-line= M NT2 M 

/eel 1 -type="teratocarci noma" 

/clone-lib="NT2RP3" 

/note="cloning vector: 

pMEl8SFL3-mRNA from NT2 neuronal 

precursor cells after 2-weeks 

retinoic acid (RA) induction 

/note="unnamed protein product" 

/codon-start=l 

/protei n-i d="BAB55269 . 1" 

/db-xref = n GI : 14042492 " 

/transl ation="MDEAVPSIFNRPWFLRTMVR 

CSYDGVEDKRIMGMQLDRASSSLY 

VAFSTCVIKVPLGRCERHGKCKKTCIASRDPYCG 

WIKEGGACSHLSPNSRLTFEQDIE 

RGNTDGLGDCHNS FVALNGHS SSL LPSTTTS DST 



AQEGYES RGGM LDWKH LLDS PDST 

DPLGAVSSHNHQDKKGVIRESYLKGHDQLVPVTL 

LAI AVI LAFVMGAVFSGITVYCVC 

DHRRKDVAWQRKEKELTHSRRGSMSSVTKLSGL 

FGDTQSKDPKPEAILTPLMHNGKL 

AT PGNTAKM L I KADQH H LD LTAL PTP EST PT LQQ 

KRK PS RGS R EWE RNQN L INACTKD 

MPPMGSPVIPTDLPLRASPSHIPSVWLPITQQG 

YQH E YVDQPKMS EVAQMALEDQAA 

TLEYKTIKEHLSSKS PNHGVN LVENLDS LPPKVP 

QREASLGPPGASLSQTGLSKRLEM 

HHSSSYGVDYKRSYPTNSLTRSHQATTLKRNNTN 

SSNSSHLSRNQSFGRGDNPPPAPQ 

RVD SIQVHSSQPS GQAVTVS RQPS LN AYN S LT RS 

GLKRTPSLKPDVPPKPSFAPLSTS MKPNDACT" 

SEQUENCE (SEQ): 

1 actaagccca ggccaggttg ctgtgctggc tcatcctcct tagaaagata tgcaacctcc 
61 aatgagttcc ctgatgatgc cctgaacttc atcaagacgc acccgctcat ggatgaggca 
121 gtgccctcca tcttcaacag gccatggttc ctgagaacaa tggtcagatg cagctatgat 
181 ggagtcgaag acaaaaggat catgggcatg cagctggaca gagcaagcag ctctctgtat 
241 gttgcgttct ctacctgtgt gataaaggtt ccccttggcc ggtgtgaacg acatgggaag 
301 tgtaaaaaaa cctgtattgc ctccagagac ccatattgtg gatggataaa ggaaggtggt 
361 gcctgcagcc atttatcacc caacagcaga ctgacttttg agcaggacat agagcgtggc 
421 aatacagatg gtctggggga ctgtcacaat tcctttgtgg cactgaatgg gcattccagt 
481 tccctcttgc ccagcacaac cacatcagat tcgacggctc aagaggggta tgagtctagg 
541 ggaggaatgc tggactggaa gcatctgctt gactcacctg acagcacaga ccctttgggg 
601 gcagtgtctt cccataatca ccaagacaag aagggagtga ttcgggaaag ttacctcaaa 
661 ggccacgacc agctggttcc cgtcaccctc ttggccattg cagtcatcct ggctttcgtc 
721 atgggggccg tcttctcggg catcaccgtc tactgcgtct gtgatcatcg gcgcaaagac 
781 gtggctgtgg tgcagcgcaa ggagaaggag ctcacccact cgcgccgggg ctccatgagc 
841 agcgtcacca agctcagcgg cctctttggg gacactcaat ccaaagaccc aaagccggag 
901 gccatcctca cgccactcat gcacaacggc aagctcgcca ctcccggcaa cacggccaag 
961 atgctcatta aagcagacca gcaccacctg gacctgacgg ccctccccac cccagagtca 
1021 accccaacgc tgcagcagaa gcggaagccc agccgcggca gccgcgagtg ggagaggaac 
1081 cagaacctca tcaatgcctg cacaaaggac atgcccccca tgggctcccc tgtgattccc 
1141 acggacctgc ccctgcgggc ctcccccagc cacatcccca gcgtggtggt cctgcccatc 
1201 acgcagcagg gctaccagca tgagtacgtg gaccagccca aaatgagcga ggtggcccag 
1261 atggcgctgg aggaccaggc cgccacactg gagtataaga ccatcaagga acatctcagc 
1321 agcaagagtc ccaaccatgg ggtgaacctt gtggagaacc tggacagcct gccccccaaa 
1381 gttccacagc gggaggcctc cctgggtccc ccgggagcct ccctgtctca gaccggtcta 
1441 agcaagcggc tggaaatgca ccactcctct tcctacgggg ttgactataa gaggagctac 
1501 cccacgaact cgctcacgag aagccaccag gccaccactc tcaaaagaaa caacactaac 
1561 tcctccaatt cctctcacct ctccagaaac cagagctttg gcaggggaga caacccgccg 
1621 cccgccccgc agagggtgga ctccatccag gtgcacagct cccagccatc tggccaggcc 
1681 gtgactgtct cgaggcagcc cagcctcaac gcctacaact cactgacaag gtcggggctg 
1741 aagcgtacgc cctcgctaaa gccggacgta ccccccaaac catcctttgc tcccctttcc 
1801 acatccatga agcccaatga tgcgtgtaca taatcccagg gggagggggt caggtgtcga 
1861 accagcaggc aaggcgaggt gcccgctcag ctcagcaagg ttctcaactg cctcgagtac 
1921 ccaccagacc aagaaggcct gcggcagagc cgaggacgct gggtcctcct ctctgggaca 
1981 caggggtact cacgaaaact gggccgcgtg gtttggtgaa ggtttgcaac ggcggggact 
2041 caccttcatt ctcttccttc actttccccc acaccctaca acaggtcgga cccacaaaag 
2101 acttcagtta tcatcacaaa catgagccaa aagcacatac ctaccccatc ccccaccccc 
2161 ccacacacac acacatgcac acaacacata cacacacacg cacagaggtg aacagaaact 
2221 gaaacatttt gtccacaact tcacgggacg tggccagact gggtttgcgt tccaacctgc 
2281 aaaacacaaa tacatttttt aaaatc 
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LOCUS (LOC): AK027501 GenBank (R) 

GenBank ACC. NO. (GBN) : AK027501 

GenBank VERSION (VER) : AK027501.1 GI: 14042222 

CAS REGISTRY NO. (RN) : 390634-60-3 

SEQUENCE LENGTH (SQL): 3041 

MOLECULE TYPE (ci) : mRNA ; linear 

DIVISION CODE (CI): Primates 

DATE (DATE): 30 Jan 2004 

DEFINITION (DEF): Homo sapiens cDNA FLJ14595 fis, clone NT2RM4002194, 

highly similar to Mus musculus ***semaphorin*** 
***Vla*** mRNA. 
KEYWORDS (ST): oligo capping; fis (full insert sequence) 

SOURCE: Homo sapiens (human) 

ORGANISM (orgn): Homo sapiens 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 



COMMENT: 

nedo human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology; cDNA library construction, 
5'- & 3' -end one pass sequencing and clone selection: Helix 
Research Institute (supported by Japan Key Technology Center etc.) 
and Department of Virology, Institute of Medical Science, 
University of Tokyo. 



reference: 

AUTHOR (AU) 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE 
REFERENCE: 

AUTHOR (AU) : 



(os): 



TITLE (TI): 
JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



FEATURES (FEAT): 
Feature Key 



1 

ota,T,; Suzuki, Y. ; Nishikawa,T. ; Otsuki,T.; 
Sugiyama,T.; lrie,R.; Wakamatsu , A. ; Hayashi,K.; 
Sato,H.; Nagai,K_; Kimura,K.; Makita,H.; sekine,M.; 
obayashi,M.; Nishi ,T. ; Shibahara,T. ; Tanaka,T.; 
lshii,s.; Yamamoto,J.; Saito,K.; Kawai,Y.; lsono,Y.; 
Nakamura.Y.; Nagahari,K.; Murakami, K.; Yasuda,T. ; 
iwayanagi ,T. ; Wagatsuma,M. ; Shi ratori , A. ; Sudo.H.; 
Hosoiri.T.; Kaku,Y.; Kodaira,H.; Kondo,H.; Sugawara,M. 
Takahashi ,M. ; Kanda,K.; Yokoi,T.; Furuya,T. ; 
Kikkawa,E.; 0mura,Y. ; Abe,K.; Kamihara,K.; Katsuta.N.; 
Sato,K.; Tanikawa,M.; Yamazaki, M.; Ninomiya,K.; 
Ishibashi ,T. ; Yamashita,H . ; Murakawa,K.; Fujimori, K.; 
Tanai,H.; Kimata,M.; Watanabe,M.; Hiraoka,s.; chiba,Y. 
Ishida,s.; ono,Y.; Takiguchi ,s . ; Watanabe,s. 
Yosida,M.; Hotuta,T. ; Kusano,J.; Kanehori,K. 
Takahashi -Fuji i ,A. ; Hara,H.; Tanase,T.; Nomura, Y. ; 
Togiya,s.; Komai,F.; Hara,R.; Takeuchi,K.; Arita,M.; 
lmose,N.; Musashino,K. ; Yuuki.H.; 0shima,A. ; Sasaki ,N. 
Aotsuka,S.; Yoshikawa,Y. ; Matsunawa,H. ; lchihara,T.; 
Shiohata,N.; Sano.S.; Moriya,s.; Momiyama,H.; Satoh,N. 
Takami,S.; Terashima,Y. ; Suzuki, o.; Nakagawa,s.; 
Senoh,A.; Mizoguchi ,H . ; Goto,Y.; shimizu,F.; Wakebe,H. 
Hishigaki ,H. ; watanabe,T.; sugiyama,A.; Takemoto,M.; 
Kawakami,B.; Yamazaki, M.; watanabe,K.; Kumagai.A.; 
Itakura,s.; Fukuzumi,Y.; Fujimori, Y. ; Komiyama,M.; 
Tashiro,H.; Tanigami,A.; Fujiwara,T.; Ono,T.; 
Yamada,K.; Fujii,Y.; Ozaki,K.; Hirao,M.; ohmori,Y.; 
Kawabata.A.; Hikiji,T.; Kobatake.N.; Inagaki,H.; 
lkema,Y.; okamoto,s.; 0kitani,R.; Kawakami,T.; 
Noguchi,s.; ltoh,T.; Shigeta,K.; Senba,T.; 
Matsumura,K. ; Nakajima,Y,; Mizuno,T.; Morinaga,M.; 
Sasaki f M.; Togashi.T.; oyama,M.; Hata,H.; watanabe,M.; 
Komatsu,T.; Mi zushima-sugano , J . ; satoh,T.; shirai,Y.; 
Takahashi ,Y. ; Nakagawa,K.; okumura,K.; Nagase,T.; 
Nomura, N . ; Kikuchi,H.; Masuho,Y.; Yamashita,R. ; 
Nakai,K.; Yada,T.; Nakamura,Y.; Ohara,o.; Isogai ,T.; 
Sugano,S . 

Complete sequencing and characterization of 21,243 

ful I -length human cDNAs 

Nat. Genet., 36 (1), 40-45 (2004) 

CA 140:158332 

2 

lsogai,T.; ota,T. ; Hayashi,K.; Sugiyama,T. ; Otsuki,T.; 

Suzuki, Y.; Nishikawa,T. ; Nagai,K.; Sugano,S.; 

Shi ratori , A. ; Sudo,H.; Wagatsuma,M. ; Hosoiri.T.; 

Kaku,Y.; Kodaira,H.; Kondo,H.; Sugawara,M.; 

Takahashi ,M. ; Chiba,Y.; lshida,s.; Murakawa,K. ; Ono,Y. 

Takiguchi ,S. ; watanabe,s.; Kimura,K.; Murakami, K.; 

Ishii,S.; Kawai,Y.; Saito,K.; Yamamoto,J.; 

Wakamatsu, A. ; Nakamura,Y.; Nagahari,K.; Masuho,Y.; 

Ni nomi ya , K . ; iwayanagi , T . 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 3041) 

Isogai ,T. ; Otsuki ,T. 

Direct Submission 

Submitted (10-MAY-2001) Takao isogai, Helix Research 
Institute, Genomics Laboratory; 1532-3 Yana, Kisarazu, 
Chiba 292-0812, Japan (E-mail :genomics@hri .co.jp, 
Tel =81-438-52-3975, Fax:81-438-52-3986) 



Location 



Qualifier 



source 



1. .3041 



/organism="Homo sapiens" 

/mol-type="mRNA" 

/db-xref= M taxon:9606" 



CDS 85.-2184 



/clone="NT2RM4002194" 

/cell-line="NT2" 

/eel 1 -type= ,f teratocarci noma" 

/clone-lib="NT2RM4" 

/note="cloning vector: 

pUCl9FL_3~mRNA from uninduced NT2 

neuronal precursor cells 

/note= M unnamed protein product" 

/codon-start=l 

/protei n-i d="BAB55158 . 1" 

/db-xref="Gl: 14042223" 

/transl ation="MLDIASVFTGRFKEQKSPDS 

TWTPVPDERVPKPRPGCCAGSSSL 

ERYATS N E F PDDTLN FIKTH P LMDEAVPS I FN RP 

WFLRTMVRYRLTKIAVDTAAGPYQ 

NHTWFLGSEKGIILKFLARIGNSGFLNDSLFLE 

EMSVYNSEKCSYDGVEDKRIMGMQ 

LDRASSSLYVAFSTCVIKVPLGRCERHGKCKKTC 

IAS RD P YCGWI K EGGACS H LS PN S 

RLTFEQDIERGNTDGLGDCHNSFVALNGHSSSLL 

PSTTTSDSTAQEGYESRGGMLDWK 

HLLDSPDSTDPLGAVSSHNHQDKKGVIRESYLKG 

H DQ LV PVT L LAI AV I L A FVM GAV F 

SGITVYCVCDHRRKDVAWQRKEKELTHSRRGSM 

SSVTKLSGLFGDTQSKDPKPEAIL 

T P LMH NGK L AT PGNTAKM L I KADQH H LD LTA L PT 

PESTPTLQQKRKPSRGSREWERNQ 

NLINACTKDMPPMGSPVIPTDLPLRASPSHIPSV 

W L P I TQQGYQH EYVDQPKM S EVA 

QMALEDQAATLEYKTIKEHLSSKSPNHGVNLVEN 

LDSLPPKVPQREASLGPPGASLSQ 

TGLSKRLEMHHSSSYGVDYKRSYPTNSLTRSHQA 

TTLKRNNTNSSNSSHLSRNQSFGR 

GDNPPPAPQRVDSIQVHSSQPSGQAVTVSRQPSL 

NAYNSLTRSGLKRTPSLKPDVPPK 

PSFAPLSTSMKPNDACT" 



SEQUENCE (SEQ) : 

1 cgtatcaacg ggcatgatgt tgtcctggca aegttttcta caccttataa cagcatccct 

61 gggtctgcag tctgtgccta tgacatgett gaeattgeca gtgtttttac tgggagattc 

121 aaggaacaga agtctcctga ttccacctgg acaccagttc ctgatgaacg agttcctaag 

181 cccaggccag gttgctgtgc tggctcatcc tccttagaaa gatatgeaac ctccaatgag 

241 ttccctgatg ataccctgaa cttcatcaag acgcacccgc tcatggatga ggcagtgccc 

301 tccatcttca acaggecatg gttcctgaga acaatggtca gataccgcct taccaaaatt 

361 gcagtggaca cagctgetgg gecatatcag aatcacactg tggtttttct gggatcagag 

421 aagggaatca tcttgaagtt tttggccaga ataggaaata gtggttttct aaatgacagc 

481 cttttcctgg aggagatgag tgtttacaac tctgaaaaat gcagctatga tggagtcgaa 

541 gacaaaagga tcatgggcat gcagctggac agagcaagca gctctctgta tgttgcgttc 

601 tctacctgtg tgataaaggt tccccttggc cggtgtgaac gacatgggaa gtgtaaaaaa 

661 acctgtattg cctccagaga cccatattgt ggatggataa aggaaggtgg tgcctgcagc 

721 catttatcac ccaacagcag actgactttt gagcaggaca tagagcgtgg caatacagat 

781 ggtctggggg actgtcacaa ttcctttgtg gcactgaatg ggcattccag ttccctcttg 

841 cccagcacaa ccacatcaga ttcgaegget caagaggggt atgagtctag gggaggaatg 

901 ctggactgga ageatctget tgactcacct gacagcacag accctttggg ggcagtgtct 

961 tcccataatc accaagacaa gaagggagtg attegggaaa gttacctcaa aggccacgac 

1021 cagctggttc ccgtcaccct ettggecatt gcagtcatcc tggctttcgt catgggggee 

1081 gtcttctegg gcatcaccgt ctactgcgtc tgtgatcatc ggegcaaaga cgtggctgtg 

1141 gtgeagegea aggagaagga gctcacccac tcgcgccggg gctccatgag cagcgtcacc 

1201 aagctcagcg gectctttgg ggacactcaa tccaaagacc caaagcegga ggccatcctc 

1261 acgccactca tgeacaaegg caagctcgcc actcccggca acacggccaa gatgetcatt 

1321 aaagcagacc agcaccacct ggacctgacg gccctcccca ccccagagtc aaccccaacg 

1381 ctgeagcaga ageggaagee cagccgcggc agecgegagt gggagaggaa ccagaacctc 

1441 ateaatgect gcacaaagga catgcccccc atgggctccc ctgtgattcc cacggacctg 

1501 cccctgcggg cctcccccag ccacatcccc agcgtggtgg tcctgcccat cacgcagcag 

1561 ggctaccagc atgagtacgt ggaccagccc aaaatgagcg aggtggccca gatggcgctg 

1621 gaggaccagg ccgccacact ggagtataag accatcaagg aacatctcag cagcaagagt 

1681 cccaaccatg gggtgaacct tgtggagaac ctggacagcc tgccccccaa agttccacag 

1741 egggaggect ccctgggtcc cccgggagcc tccctgtctc agaceggtet aagcaagegg 

1801 ctggaaatgc accactcctc ttcctacggg gttgactata agaggagcta ccccacgaac 

1861 tcgctcacga gaagccacca ggccaccact ctcaaaagaa acaacactaa ctcctccaat 

1921 tcctctcacc tctccagaaa ccagagcttt ggcaggggag acaacccgcc gcccgccccg 

1981 cagagggtgg actccatcca ggtgcacagc tcccagccat ctggccaggc cgtgactgtc 

2041 tcgaggcagc ccagcctcaa cgcctacaac tcactgacaa ggtegggget gaagcgtacg 

2101 ccctcgctaa ageeggaegt accccccaaa ccatcctttg ctcccctttc cacatccatg 

2161 aagcccaatg atgcgtgtac ataatcccag ggggaggggg tcaggtgtcg aaccagcagg 

2221 caaggegagg tgcccgctca gctcagcaag gttctcaact gectcgagta cccaccagac 



2281 caagaaggcc tgcggcagag ccgaggacgc tgggtcctcc tctctgggac acaggggtac 

2341 tcacgaaaac tgggccgcgt ggtttggtga aggtttgcaa cggcggggac tcaccttcat 

2401 tctcttcctt cactttcccc cacaccctac aacaggtcgg acccacaaaa gacttcagtt 

2461 atcatcacaa acatgagcca aaagcacata cctaccccat cccccacccc cccacacaca 

2521 cacacatgca cacaacacat acacacacac gcacagaggt gaacagaaac tgaaacattt 

2581 tgtccacaac ttcacgggac gtggccagac tgggtttgcg ttccaacctg caaaacacaa 

2641 atacattttt taaaatcaag aaaatttaaa aagacaaaaa aaaaagaatt cattgataat 

2701 tctaactcag actttaacaa tggcagaagt ttactatgcg caaatactgt gaaatgcccg 

2761 ccagtgttac agctttctgt tgcagcagat aaatgccatg ttgggcaact atgtcataga 

2821 tttctgctcc tcctctcttt taatgaaata acgtgaccgt taacgcaagt aactctttat 

2881 ttattgttca cccttttttt ccttaaggaa aggactcttc caaatatcat cctatgaaca 

2941 gctcttcaga aagcccattg aaagttaaac tatttaacgt gaaatccatt aactggaata 
3001 attgagtttc tttattttta caataaattc actgagtaaa t 

L4 ANSWER 62 OF 86 GENBANK . RTM . COPYRIGHT 2004 on STN 

LOCUS (LOC): AK027471 GenBank (R) 

GenBank ACC. NO. (GBN) : AK027471 

GenBank VERSION (VER) : AK027471.1 GH14042170 

CAS REGISTRY NO. (RN) : 390636-83-6 

SEQUENCE LENGTH (SQL): 2293 

molecule TYPE (CI) : mRNA; linear 
DIVISION CODE (CI): Primates 
DATE (DATE): 30 Jan 2004 

DEFINITION (DEF): Homo sapiens cDNA FLJ14565 fis, clone NT2RM4000233, 

highly similar to Mus musculus ***semaphorin*** 
***Vla*** mRNA. 
KEYWORDS (ST): oligo capping; fis (full insert sequence) 

SOURCE: Homo sapiens (human) 

ORGANISM (ORGN): Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

COMMENT : 

NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology; cDNA library construction, 
5'- & 3'-end one pass sequencing and clone selection: Helix 
Research Institute (supported by Japan Key Technology center etc.) 
and Department of virology, Institute of Medical Science, 
University of Tokyo. 
REFERENCE: 1 

AUTHOR (AU): ota,T. ; Suzuki ,Y. ; Nishikawa,T. ; otsuki,T.; 

Sugiyama,T.; lrie,R.; Wakamatsu ,A. ; Hayashi,K.; 

Sato,H.; Nagai,K.; Kimura,K.; Makita,H.; Sekine.M.; 

Obayashi.M.; Nishi,T.; shibahara,T. ; Tanaka,T.; 

lshii,S.; Yamamoto,J.; Saito,K.; Kawai.Y.; lsono,Y.; 

Nakamura,Y.; Nagahari,K.; Murakami ,K.; Yasuda,T.; 

iwayanagi ,T. ; Wagatsuma.M. ; Shi ratori ,A. ; sudo,H.; 

Hosoiri,T.; Kaku,Y.; Kodaira,H.; Kondo,H.; Sugawara,M.; 

Takahashi ,M. ; Kanda,K. ; Yokoi,T.; Furuya,T.; 

Kikkawa,E.; Omura,Y.; Abe,K.; Kamihara,K.; Katsuta,N.; 

Sato,K.; Tanikawa,M. ; Yamazaki,M.; Ninomiya.K.; 

ishibashi ,T. ; Yamashita.H. ; Murakawa,K.; Fujimori, K.; 

Tanai,H.; Kimata.M.; Watanabe,M.; Hiraoka,S.; Chiba,Y.; 

lshida,S.; Ono,Y.; Takiguchi ,s. ; Watanabe,S.; 

Yosida,M.; Hotuta,T.; Kusano,J.; Kanehori,K.; 

Takahashi -Fuji i ,A. ; Hara,H.; Tanase,T.; Nomura, Y. ; 

Togiya,s.; Komai,F.; Hara,R.; Takeuchi ,K. ; Arita,M.; 

lmose,N.; Musashino,K. ; Yuuki,H.; Oshima,A. ; Sasaki, N. 

Aotsuka,S.; Yoshikawa, Y. ; Matsunawa,H. ; Ichihara.T.; 

Shiohata,N.; sano,s.; Moriya,S.; Momiyama,H.; Satoh,N. 

Takami,s.; Terashima,Y. ; Suzuki ,0.; Nakagawa,s.; 

Senoh,A.; Mizoguchi ,H . ; Goto,Y.; Shimizu.F.; Wakebe,H. 

Hishigaki , H. ; Watanabe ,T.; Sugiyama,A. ; Takemoto,M.; 

Kawakami , B . ; Yamazaki , M . ; Watanabe , K . ; Kumagai , A . ; 

Itakura.s.; Fukuzumi,Y.; Fujimori ,Y.; Komiyama,M.; 

Tashiro,H.; Tanigami,A.; Fujiwara,T.; Ono,T.; 

Yamada,K.; Fujii,Y.; Ozaki,K.; Hirao,M.; Ohmori,Y.; 

Kawabata,A.; Hikiji,T.; Kobatake,N.; lnagaki,H.; 

Ikema,Y.; okamoto,S.; Okitani,R.; Kawakami, T.; 

Noguchi,s.; ltoh,T.; Shigeta,K.; Senba.T.; 

Matsumura,K. ; Nakajima,Y. ; Mizuno,T.; Morinaga,M.; 

Sasaki, M.; Togashi,T.; Oyama,M.; Hata,H.; Watanabe, M.; 

Komatsu,T. ; Mizushima-sugano , J . ; Satoh f T. ; shirai f Y. ; 

Takahashi ,y. ; Nakagawa,K.; okumura,K.; Nagase,T.; 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) 



Nomura, N.; Kikuchi,H.; Masuho,Y.; Yamashita,R. ; 
Nakai,K.; Yada,T.; Nakamura,Y. ; ohara.O.; lsogai,T.; 
Sugano,S. 

Complete sequencing and characterization of 21,243 

full-length human cDNAs 

Nat. Genet., 36 (1), 40-45 (2004) 

CA 140:158332 

2 

lsogai,T.; ota,T.; Hayashi,K.; sugiyama,T.; otsuki,T.; 

Suzuki, Y.; Nishikawa,T. ; Nagai,K,; Sugano,s.; 

Shiratori ,A. ; Sudo.H.; Wagatsuma, M. ; Hosoiri,T.; 

Kaku , Y . ; Kodai ra , H . ; Kondo , H . ; Sugawara , M . ; 

Takahashi ,M. ; chiba,Y.; Ishida,s.; Murakawa,K.; Ono,Y.; 

Takiguchi ,S. ; Watanabe,S.; Kimura,K.; Murakami, K.; 

Ishii»s.; Kawai,Y.; Saito,K.; Yamamoto,J.; 

Wakamatsu,A. ; Nakamura,Y.; Nagahari,K.; Masuho,Y.; 

Ninomiya,K. ; Iwayanagi ,T. 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 2293) 

lsogai,T.; Otsuki,T. 

Direct Submission 

Submitted (10-MAY-2001) Takao Isogai , Helix Research 
Institute, Genomics Laboratory; 1532-3 Yana, Kisarazu, 
chiba 292-0812, Japan (E-mail :genomics@hri .co.ip. 
Tel =81-438-52-3975, Fax:81-438-52-3986) 



FEATURES (FEAT): 

Feature Key Location 

source 1. .2293 



Qual ifier 



CDS 



138. .1826 



/organism= n Homo sapiens" 

/mol-type= ,, mRNA" 

/db-xref ="taxon : 9606" 

/clone="NT2RM4000233 M 

/cell-line="NT2 M 

/eel 1 -type="teratocarci noma" 

/cl one-1 i b= M NT2RM4" 

/note="cloning vector: 

pUCl9FL3-mRNA from uninduced NT2 

neuronal precursor cells 

/note= M unnamed protein product" 

/codon-start=l 

/protei n-i d="BAB55136 . 1" 

/db-xref="GI : 14042171" 

/trans! ation="MSVYNSEKCSYDGVEDKRIM 

GMQLDRASSSLYVAFSTCVIKVPL 

GRCERHGKCKKTCIASRDPYCGWIKEGGACSHLS 

PNSRLTFEQDIERGNTDGLGDCHN 

S FVALNGHSSS LL PSTTTSDSTAQEGYESRGGML 

DWKHLLDSPDSTDPLGAVSSHNHQ 

DKKGVIRESYLKGHDQLVPVTLLAIAVILAFVMG 

AVFSGITVYCVCDHRRKDVAWQR 

KEKELTHSRRGSMSSVTKLSGLFGDTQSKDPKPE 

AILTPLMHNGKLATPGNTAKMLIK 

ADQHHLDLTALPTPESTPTLQQKRKPSRGSREWE 

RNQNLINACTKDMPPMGSPVIPTD 

LPLRASPSHIPSVWLPITQQGYQHEYVDQPKMS 

EVAQMALEDQAATLEYKTIKEHLS 

SKSPNHGVN LVENLDSLPPKVPQREASLGPPGAS 

LSQTGLSKRLEMHHSSSYGVDYKR 

SYPTNSLTRSHQATTLKRNNTNSSNSSHLSRNQS 

FGRGDNPPPAPQRVDSIQVHSSQP 

SGQAVTVSRQPSLNAYNSLTRSGLKRTPSLKPDV 

PPKPSFAPLSTSMKPNDACT" 



SEQUENCE (SEQ): 

1 ttgcagtgga 
61 agaagggaat 
121 gccttttcct 
181 aagacaaaag 
241 tctctacctg 
301 aaacctgtat 
361 gecatttate 
421 atggtctggg 
481 tgcccagcac 
541 tgctggactg 



cacagctgct 
catcttgaag 
ggaggagatg 
gatcatgggc 
tgtgataaag 
tgcctccaga 
acccaacagc 
ggactgtcac 
aaccacatca 
gaagcatttg 



gggecatate 
tttttggcca 
agtgtttaca 
atgcagctgg 
gttccccttg 
gacccatatt 
agactgactt 
aattcctttg 
gattcgaegg 
cttgactcac 



agaatcacac 
gaataggaaa 
actctgaaaa 
acagagcaag 
gccggtgtga 
gtggatggat 
ttgagcagga 
tggcactgaa 
ctcaagaggg 
ctgacagcac 



tgtggttttt 
tagtggtttt 
atgeagctat 
cagctctctg 
acgacatggg 
aaaggaaggt 
catagagegt 
tgggcattcc 
gtatgagtct 
agaccctttg 



ctgggatcag 
ctaaatgaca 
gatggagtcg 
tatgttgcgt 
aagtgtaaaa 
ggtgcctgca 
ggcaatacag 
agttccctct 
aggggaggaa 
ggggcagtgt 



601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 



cttcccataa 
accagctggt 
ccgtcttctc 
tggtgcagcg 
ccaagctcag 
tcacgccact 
ttaaagcaga 
cgctgcagca 
tcatcaatgc 
tgcccctgcg 
agggctacca 
tggaggacca 
gtcccaacca 
agcgggaggc 
ggctggaaat 
actcgctcac 
attcctctca 
cgcagagggt 
tctcgaggca 
cgccctcgct 
tgaagcccaa 
ggcaaggcga 
accaagaagg 
actcacgaaa 
attctcttcc 
ttatcatcac 
cacacacatg 
tttgtccaca 
aaatacattt 



tcaccaagac 
tcccgtcacc 
gggcatcacc 
caaggagaag 
cggcctcttt 
catgcacaac 
ccagcaccac 
gaagcggaag 
ctgcacaaag 
ggcctccccc 
gcatgagtac 
ggccgccaca 
tggggtgaac 
ctccctgggt 
gcaccactcc 
gagaagccac 
cctctccaga 
ggactccatc 
gcccagcctc 
aaagccggac 
tgatgcgtgt 
ggtgcccgct 
cctgcggcag 
actgggccgc 
ttcactttcc 
aaacatgagc 
cacacaacac 
acttcacggg 
ttt 



aagaagggag 
ctcttggcca 
gtctactgcg 
gagctcaccc 
ggggacactc 
ggcaagctcg 
ctggacctga 
cccagccgcg 
gacatgcccc 
agccacatcc 
gtggaccagc 
ctggagtata 
cttgtggaga 
cccccgggag 
tcttcctacg 
caggccacca 
aaccagagct 
caggtgcaca 
aacgcctaca 
gtacccccca 
acataatccc 
cagctcagca 
agccgaggac 
gtggtttggt 
cccacaccct 
caaaagcaca 
atacacacac 
acgtggccag 



tgattcggga 
ttgcagtcat 
tctgtgatca 
actcgcgccg 
aatccaaaga 
ccactcccgg 
cggccctccc 
gcagccgcga 
ccatgggctc 
ccagcgtggt 
ccaaaatgag 
agaccatcaa 
acctggacag 
cctccctgtc 
gggttgacta 
ctctcaaaag 
ttggcagggg 
gctcccagcc 
actcactgac 
aaccatcctt 

agggggaggg 

aggttctcaa 
gctgggtcct 
gaaggtttgc 
acaacaggtc 
tacctacccc 
acgcacagag 
actgggtttg 



aagttacctc 
cctggctttc 
tcggcgcaaa 
gggctccatg 
cccaaagccg 
caacacggcc 
caccccagag 
gtgggagagg 
ccctgtgatt 
ggtcctgccc 
cgaggtggcc 
ggaacatctc 
cctgcccccc 
tcagaccggt 
taagaggagc 
aaacaacact 
agacaacccg 
atctggccag 
aaggtcgggg 
tgctcccctt 
ggtcaggtgt 
ctgcctcgag 
cctctctggg 
aacggcgggg 
ggacccacaa 
atcccccacc 
gtgaacagaa 
cgttccaacc 



aaaggccacg 
gtcatggggg 
gacgtggctg 
agcagcgtca 
gaggccatcc 
aagatgctca 
tcaaccccaa 
aaccagaacc 
cccacggacc 
atcacgcagc 
cagatggcgc 
agcagcaaga 
aaagttccac 
ctaagcaagc 
taccccacga 
aactcctcca 
ccgcccgccc 
gccgtgactg 
ctgaagcgta 
tccacatcca 
cgaaccagca 
tacccaccag 
acacaggggt 
actcaccttc 
aagacttcag 
cccccacaca 
actgaaacat 
tgcaaaacac 
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LOCUS (LOC): AK027439 
GenBank ACC. NO. (GBN) : AK027439 
GenBank VERSION (VER) : AK027439 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) 



GenBank (R) 



1 Gl: 14042113 
390631-17-1 
2123 

mRNA; linear 
Primates 
30 Jan 2004 

Homo sapiens cDNA FLJ14533 fis, clone NT2RM2000407 , 
moderately similar to Mus musculus ***semaphorin*** 

***Vla*** mRNA. 
oligo capping; fis (full insert sequence) 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

COMMENT : 

NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology; cDNA library construction, 
5'- & 3'-end one pass sequencing and clone selection: Helix 
Research Institute (supported by Japan Key Technology Center etc.) 
and Department of virology, Institute of Medical Science, 
university of Tokyo. 



DEFINITION (DEF) : 



KEYWORDS (ST): 

source: 

ORGANISM (ORGN) 



REFERENCE: 
AUTHOR 



(AU) 



ota,T. ; Suzuki, Y.; Nishikawa,T. ; otsuki,T.; 
Sugiyama,T.; lrie,R.; Wakamatsu ,A. ; Hayashi,K.; 
Sato,H.; Nagai,K.; Kimura,K.; Makita,H.; Sekine,M.; 
obayashi,M.; Nishi,T.; shibahara,T. ; Tanaka,T. ; 
lshii,s.; Yamamoto,J.; Saito,K.; Kawai,Y.; lsono,Y.; 
Nakamura,Y.; Nagahari,K.; Murakami, K.; Yasuda,T.; 
Iwayanagi ,T. ; Wagatsuma,M. ; Shi ratori , A. ; sudo,H.; 
Hosoiri,T.; Kaku,Y.; Kodaira,H.; Kondo,H.; sugawara,M. 
Takahashi ,M. ; Kanda.K.; Yokoi,T.; Furuya,T.; 
Kikkawa,E.; 0mura,Y. ; Abe,K.; Kamihara,K.; Katsuta,N.; 
Sato,K.; Tanikawa,M. ; Yamazaki,M.; Ninomiya,K.; 
Ishibashi ,T. ; Yamashita,H. ; Murakawa,K.; Fujimori, K.; 
Tanai,H.; Kimata,M.; Watanabe,M.; Hiraoka,s.; chiba,Y. 
Ishida.s.; Ono,Y.; Takiguchi ,s. ; Watanabe,s.; 
Yosida,M.; Hotuta,T. ; Kusano,J.; Kanehori,K.; 
Takahashi -Fujii , A. ; Hara,H.; Tanase,T.; Nomura, Y. ; 
Togiya,s.; Komai,F.; Hara,R.; Takeuchi,K.; Arita,M. ; 
Imose,N.; Musashino,K. ; Yuuki,H.; oshima f A.; Sasaki ,N. 



TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI) : 
JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



Aotsuka,S.; Yoshikawa,Y. ; Matsunawa,H. ; lchihara,T.; 
shiohata,N. ; Sano,s.; Moriya.S.; Momi yama , H . ; Satoh.N 
TakamijS.; Terashima, Y. ; Suzuki ,0.; Nakagawa,S.; 
senoh,A.; Mizoguchi ,H. ; Goto,Y.; shimizu,F.; Wakebe.H 
Hishigaki ,H. ; Watanabe,T.; sugiyama,A. ; Takemoto , M . ; 
Kawakami , B . ; Yamazaki , M . ; Wat anabe , K . ; Kumagai , A . ; 
Itakura,S.; Fukuzumi,Y.; Fujimori, Y.; Komiyama,M.; 
Tashiro,H.; Tanigami ,A. ; Fujiwara,T.; ono,T.; 
Yamada,K.; Fujii,Y.; ozaki,K.; Hirao.M.; ohmori,Y.; 
Kawabata,A.; Hikiji,T.; Kobatake,N.; Inagaki,H.; 
Ikema,Y.; okamoto , s . ; Okitani,R.; Kawakami ,T.; 
Noguchi,s.; ltoh,T.; Shigeta,K.; Senba,T.; 
Matsumura,K. ; Nakajima,Y. ; Mizuno,T.; Morinaga,M.; 
Sasaki, M.; Togashi.T.; 0yama,M.; Hata,H.; Watanabe,M. 
Komatsu,T.; Mizushima-Sugano, J . ; satoh,T.; Shirai,Y.; 
Takahashi ,Y. ; Nakagawa,K.; okumura,K.; Nagase,T.; 
Nomura, N.; Kikuchi,H.; Masuho,Y.; Yamashita,R. ; 
Nakai ,K. ; Yada , T . ; Nakamu ra , Y . ; ohara , 0 . ; Isogai ,T. ; 
sugano,s. 

Complete sequencing and characterization of 21,243 

full-length human cDNAs 

Nat. Genet., 36 (1), 40-45 (2004) 

CA 140:158332 

2 

Isogai, T. ; ota,T. ; Hayashi,K.; Sugiyama.T.; Otsuki,T. 

Suzuki, Y.; Nishikawa,T. ; Nagai,K.; Sugano,s.; 

shiratori ,A. ; Sudo,H.; Wagatsuma,M. ; Hosoiri,T.; 

Kaku,Y.; Kodaira.H.; Kondo,H.; sugawara,M.; 

Takahashi ,M. ; Chiba,Y.; lshida,S.; Murakawa,K.; Ono,Y 

Takiguchi ,S. ; Watanabe,s.; Kimura,K.; Murakami, K.; 

Ishii,S.; Kawai,Y.; Saito,K.; Yamamoto,J.; 

Wakamatsu,A. ; Nakamura,Y.; Nagahari,K.; Masuho,Y.; 

Ninomiya,K. ; Iwayanagi ,T. 

NEDO human cDNA sequencing project 

Unpublished 

3 (bases 1 to 2123) 

Isogai J.; otsuki,T. 

Direct submission 

Submitted (10-MAY-2001) Takao Isogai, Helix Research 
institute, Genomics Laboratory; 1532-3 Yana, Kisarazu 
Chiba 292-0812, Japan (E-mail :genomics@hri .co.jp, 
Tel :81-438-52-3975, Fax:81-438-52-3986) 



FEATURES (FEAT): 

Feature Key Location 

source 1..2123 



CDS 



127. .1650 



Qualifier 

/orqanism="Homo sapiens" 

/mol-type="mRNA" 

/db-xref="taxon:9606" 

/clone="NT2RM2000407" 

/cell-line="NT2" 

/eel 1 -type="teratocarci noma" 

/clone-lib="NT2RM2" 

/note="cloning vector: 

pUC19FL3~mRNA from uninduced NT2 

neuronal precursor cells 

/note="unnamed protein product" 

/codon-start=l 

/protein-id="BAB55111.1 M 

/db-xref ="GI : 14042114" 

/transl ation="MSVYNSEKCSYDGVEDKRIM 

GMQLDRASSSLYVAFSTCVIKVPL 

GRCE RH GKCKKTCIAS RDPYCGWI K EGGACS H LS 

PNSRLTFEQDIERGNTDGLGDCHN 

SFVALNGVIRESYLKGHDQLVPVTLLAIAVILAF 

VMGAVFSGITVYCVCDHRRKDVAV 

VQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKDP 

KPEAILTPLMHNGKLATPGNTAKM 

LIKADQHHLDLTALPTPESTPTLQQKRKPSRGSR 

EWE RNQN L I NACTKDM P PMGS PVI 

PTDLPLRASPSHI PS VW L P ITQQG YQH E YVDQ P 

KM S EVAQMA L E DQAAT L E YKTI K E 

HLSSKSPNHGVNLVENLDSLPPKVPQREASLGPP 

GASLSQTGLSKRLEMHHSSSYGVD 

YKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLSR 

NQSFGRGDNPPPAPQRVDSIQVHS 



SQPSGQAVTVSRQPS LNAYNS LTRSGLKRTPS LK 
PDVPPKPSFAPLSTSMKPNDACT" 



SEQUENCE (SEQ): 

1 aaaaaaaaaa aaacatatca gaatcacact gtggtttttc tgggatcaga gaagggaatc 
61 atcttgaagt ttttggccag aataggaaat agtggttttc taaatgacag ccttttcctg 
121 gaggagatga gtgtttacaa ctctgaaaaa tgcagctatg atggagtcga agacaaaagg 
181 atcatgggca tgcagctgga cagagcaagc agctctctgt atgttgcgtt ctctacctgt 
241 gtgataaagg ttccccttgg ccggtgtgaa cgacatggga agtgtaaaaa aacctgtatt 
301 gcctccagag acccatattg tggatggata aaggaaggtg gtgcctgcag ccatttatca 
361 cccaacagca gactgacttt tgagcaggac atagagcgtg gcaatacaga tggtctgggg 
421 gactgtcaca attcctttgt ggcactgaat ggagtgattc gggaaagtta cctcaaaggc 
481 cacgaccagc tggttcccgt caccctcttg gccattgcag tcatcctggc tttcgtcatg 
541 ggggccgtct tctcgggcat caccgtctac tgcgtctgtg atcatcggcg caaagacgtg 
601 gctgtggtgc agcgcaagga gaaggagctc acccactcgc gccggggctc catgagcagc 
661 gtcaccaagc tcagcggcct ctttggggac actcaatcca aagacccaaa gccggaggcc 
721 atcctcacgc cactcatgca caacggcaag ctcgccactc ccggcaacac ggccaagatg 
781 ctcattaaag cagaccagca ccacctggac ctgacggccc tccccacccc agagtcaacc 
841 ccaacgctgc agcagaagcg gaagcccagc cgcggcagcc gcgagtggga gaggaaccag 
901 aacctcatca atgcctgcac aaaggacatg ccccccatgg gctcccctgt gattcccacg 
961 gacctgcccc tgcgggcctc ccccagccac atccccagcg tggtggtcct gcccatcacg 
1021 cagcagggct accagcatga gtacgtggac cagcccaaaa tgagcgaggt ggcccagatg 
1081 gcgctggagg accaggccgc cacactggag tataagacca tcaaggaaca tctcagcagc 
1141 aagagtccca accatggggt gaaccttgtg gagaacctgg acagcctgcc ccccaaagtt 
1201 ccacagcggg aggcctccct gggtcccccg ggagcctccc tgtctcagac cggtctaagc 
1261 aagcggctgg aaatgcacca ctcctcttcc tacggggttg actataagag gagctacccc 
1321 acgaactcgc tcacgagaag ccaccaggcc accactctca aaagaaacaa cactaactcc 
1381 tccaattcct ctcacctctc cagaaaccag agctttggca ggggagacaa cccgccgccc 
1441 gccccgcaga gggtggactc catccaggtg cacagctccc agccatctgg ccaggccgtg 
1501 actgtctcga ggcagcccag cctcaacgcc tacaactcac tgacaaggtc ggggctgaag 
1561 cgtacgccct cgctaaagcc ggacgtaccc cccaaaccat cctttgctcc cctttccaca 
1621 tccatgaagc ccaatgatgc gtgtacataa tcccaggggg agggggtcag gtgtcgaacc 
1681 agcaggcaag gcgaggtgcc cgctcagctc agcaaggttc tcaactgcct cgagtaccca 
1741 ccagaccaag aaggcctgcg gcagagccga ggacgctggg tcctcctctc tgggacacag 
1801 gggtactcac gaaaactggg ccgcgtggtt tggtgaaggt ttgcaacggc ggggactcac 
1861 cttcattctc ttccttcact ttcccccaca ccctacaaca ggtcggaccc acaaaagact 
1921 tcagttatca tcacaaacat gagccaaaag cacataccta ccccatcccc cacccccaca 
1981 cacacacaca catgcacaca acacatacac acacacgcac agaggtgaac agaaactgaa 
2041 acattttgtc cacaacctca cgggacgtgg ccagactggg tttgcgttcc aacctgcaaa 
2101 acacaaatac attttttaaa ate 
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LOCUS (LOC): BG729430 GenBank (R) 

GenBank ACC. NO. (GBN) : BG729430 

GenBank VERSION (VER) : BG729430.1 Gl: 14014505 

CAS REGISTRY NO. (RN) : 337152-08-6 

SEQUENCE LENGTH (SQL): 508 

MOLECULE TYPE (CI) : mRNA; linear 

DIVISION CODE (CI) : Expressed sequence tag 

DATE (DATE): 9 May 2001 

DEFINITION (def): fo84dl2.yl zebrafish gridded kidney Danio rerio cDNA 

clone 4726726 5' similar to TR:035464 035464 

***SEMAPHORIN*** *** VIA *** PRECURSOR. ;, mRNA 

sequence. 
source: zebrafish. 
organism (orgn): Danio rerio 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopterygii; Teleostei; 
Ostariophysi ; cypriniformes ; Cyprinidae; Danio 
NUCLEIC ACID COUNT (NA) : 106 a 152 c 143 g 107 t 
COMMENT: 

Contact: Stephen L. Johnson 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email : zbraf ishOwatson.wustl .edu 

cDNA Library Preparation: Leonard ira Zon DNA sequencing by: 
Washington University Genome Sequencing Center clone distribution: 
Genome Systems, St. Louis, Missouri (web address: 
www.genomesystems.com) (email contact: info@genomesystems.com) and 
Research Genetics, Huntsville, Alabama (web address: www.resgen.com 
) (email contact: info@resgen.com) and 

RessourcenZentrumPrimarDatenbank, Berlin, Germany (web address: 
www. rzpd.de) 



Seq primer: T3 ET from Amersham 
High quality sequence stop: 480. 



REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO) : 

FEATURES (FEAT): 
Feature Key 



1 (bases 1 to 508) 
Clark, M.; Johnson, S.L. ; Lehrach.H.; Lee,R.; Li , F. ; 
Marra,M.; Eddy.S.y; Hillier,L.; Kucaba,T.; Martin, J.; 
Beck,C.; Wylie,T.; Underwood, K. ; Steptoe,M.; 
Theising,B.; Allen,M.; Bowers, Y. ; Person, B.; 
Swaller,T.; Gibbons, M.; Pape,D.; Harvey, N.; Schurk,R. 
Ritter,E.; Kohn,s.; shin,T.; Jackson, Y. ; Cardenas, M.; 
McCann,R.; wilson,R. 
washu Zebrafish EST Project 1998 
Unpublished (1998) 



Location 



Qualifier 



source 



1..508 



/organism="Danio rerio" 

/db-xref="taxon:7955" 

/clone="4726726" 

/clone-lib="zebrafish gridded 

kidney" 

/sex= M mixed" 

/tissue-type="kidney pooled from 
300 wild type adults" 
/lab-host= ,, XLOLR" 
/note= n 0rgan : kidney; Vector: 
pBK-CMV; Site-1: EcoRI; site-2: 
Xhol; oligo dT cDNA library 
constructed from mRNA pooled from 
pooled kidney tissue from 300 
adult zebrafish." 



SEQUENCE (SEQ): 
1 tttccgcgtg 
61 aaacagtgga 
121 cttctacttc 
181 ggtcatggcc 
241 tgacatgtca 
301 ctccacctgg 
361 cggctcgtcc 
421 cttcatcaaa 
481 gttcctcaag 



tggcgcgtgt gtgtaagagc gacagcggcg gctctcagcg 
catctttcct gaagacgcgc ctgaactgct ccatattctg 
aacatcctgc aggccgtgac cgacgtcata cacatcagcg 
accttctcca caccctacaa cagtattccg ggatcagctg 
gagatcgctc tggccttcac cggacgcttc aaagagcaga 
acacctgtgc ccgaggagag ggtgcccaaa cccaggccgg 
tccgtggaga agtttaaagt gtcgagtgag tttccggacg 
gcccacccgc tgatggagga ggccgtgccc gccatcgcca 
accatggtga ggtaccgg 



tgtgctggag 
gagactcaca 
gacgagacgt 
tgtgtgcgta 
aatcacctga 
gtgtgtgtgc 
attctctgag 
accggccctg 
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LOCUS (loc): 

GenBank acc. NO. (GBN) 

GenBank VERSION (VER) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



source: 

ORGANISM (ORGN) : 



BG642108 
BG642108 
BG642108.1 Gl: 13774034 
333222-70-1 
622 

mRNA; linear 
Expressed sequence tag 
23 Apr 2001 

pgllc.pk002.g3 Chicken liver cDNA library Gallus gall 
cDNA clone pgllc.pk002.g3 5' similar to gi | 9055334 
ref |NP_061214.1| sema domain, transmembrane domain 
(TM), and cytoplasmic domain, ( ***semaphorin*** 
***6A*** [mus musculus] sp| 035464 | SM6AJVIOUSE 
***SEMAPHORIN*** ***5 A *** PRECURSOR ( 

***SEMAPHORIN*** *** V IA*** ) (SEMA VIA) 

(SEMAPHORIN Q) (SEMA Q) gb | AAB86408 . 1 | , mRNA sequence 

chicken. 

Gallus gallus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Archosauria; Aves; Neognathae; 
Gal li formes; Phasianidae; Phasianinae; Gallus 
186 a 120 c 147 g 163 t 6 others 



) 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Contact: Larry A. Cogburn 

University of Delaware 

Townsend Hall, Newark, DE 19717, USA 

Tel: 302-831-1335 

Fax: 302-831-2822 

Email: cogburn@udel.edu, www.chickest.udel.edu. 
REFERENCE: 1 (bases 1 to 622) 

AUTHOR (AU) : Morgan, R.W.; Burnside,J . ; Cogburn, L. A 



TITLE (TI): 
JOURNAL (SO): 



Chicken liver ESTs (2001) 
Unpublished (2001) 



FEATURES (FEAT): 

Feature Key Location 

source 1..622 



Qualifier 



chicken liver cDNA 



/organism= M Gallus gall us" 
/db-xref="taxon:9031" 
/cl one=" pgl lc . pk002 . q3" 
/clone-liB= n <' " 
library" 
/sex="Male and Female" 
/tissue-type="liver" 
/lab-host= E.coli EMDH10B" 
/note="vector : pCMVSPORT 6" 

SEQUENCE (SEQ): 

1 gaggcttcct ctgctgtgcg cctctgtgat gctaatgagt ctgtcccagt gccgagctgt 
61 cagctttcct gaagatgagg accctattaa cattgttgac taccactatt caaggcaata 
121 tccagtattt agaggacgcc cttcaggcaa tgaatctcag cacagactgg acttccaact 
181 gatgctgaaa attcgagaca cactttatat cgctggcagg gatcaagttt acactgtaaa 
241 cttaaatgaa gttccaaaat cagaagttac tccaagcagg aaattaacat ggaggtcaag 
301 gcagcaggac agagagaact gtgctatgaa aggaaaacat aaagatgaat gccataactt 
361 cattaaagtc ttcgttccaa gaaatgacga gatggtgttt gtctgtggaa caaatgcatt 
421 taatcctatg tgcagatact atcggctgag tacgttagag tatgatgggg aggaaattag 
481 tggtttggca agatgcccat ttgatgccag acaaaccaat gtcgnnnnnn ttgctgatgg 
541 aaaattgtat tcagcaacag tagcagattt cctggcaagt gatgctgtta tttatcgcag 
601 catgggagat ggatctgccc ta 



L4 ANSWER 66 OF 86 

LOCUS (LOC): 

GenBank ACC. NO. (gbn) : 

GenBank VERSION (VER) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



SOURCE: 
ORGANISM (ORGN); 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 
Gl:13565232 



BG553452 
BG553452 
BG553452.1 
330900-90-8 
516 

mRNA; linear 
Expressed sequence tag 
9 Apr 2001 

dab84d05.yl NICHD XGC Emb4 Xenopus laevis cDNA clone 
IMAGE:4203704 5' similar to TR:035464 035464 



*VIA* 



PRECURSOR. 



mRNA 



***SEMAPHORIN*** 
sequence. 

African clawed frog. 
Xenopus laevis 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Amphibia; Batrachia; Anura; 
Mesobatrachia; Pipoidea; Pipidae; Xenopodinae; Xenopus 
137 a 112 c 115 g 152 t 



Ph.D. 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: Martha Rebbert, Steven L. Klein, 
cDNA Library Preparation: Life Technologies, inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA sequencing by: Washington University Genome Sequencing Center 
Clone distribution: Xenopus clones from this library are available 
through the I.M.A.G.E. consortium/LLNL at: info@image.llnl.gov 
Seq primer: -40RP from Gibco 
High quality sequence stop: 386. 
REFERENCE: 1 (bases 1 to 516) 

author (au): nci-cgap http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, cancer Genome Anatomy 

Project (CGAP) , Tumor Gene Index 
JOURNAL (so): unpublished (1997) 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1. .516 



/organism="Xenopus laevis" 

/db-xref="taxon:8355" 

/cl one=" IMAGE : 4203704" 

/clone-! ib="NICHD XGC Emb4" 

/dev-stage="embryo, stage 31-32" 

/lab-host="DHl0B 

(phage- res i stant) " 



/note="organ: whole embryo; 
Vector: PCMV-SP0RT6; site-1: Notl; 
site-2: Sail; cloned 
unidirectionally. Primer: oligo 
dT. Average insert size 2.1 kb. 
Constructed by Life Technologies. 
Note: This is a Xenopus Gene 
collection (XGC ) library." 



SEQUENCE (SEQ): 

1 gacccacgcg 
61 tccctgaacg 
121 gcagtcgtga 
181 acggcaggag 
241 cttttggcca 
301 gtatactgta 
361 gatatgagtc 
421 agtgacaatc 
481 gggaaaattg 



tccgactcct 
tctgactctg 
gctgctcaga 
tgattcgtga 
ttgctgttat 
tctgtgacca 
gttcacgcag 
agtccaaaga 
ctacacctaa 



tagaggtaaa 
gagccttctc 
actcatgtaa 
aagttatctc 
tcttgctttt 
tcgtcgaagg 
aggctctatg 
gtcaaagcca 
tacaactgct 



tgcagccaac 
agtttgctta 
tagaagcatt 
aaaggtcaca 
gtcatgggtg 
gatattgatg 
gcaagtgtga 
gaggctatcc 
aaaatg 



tctcttgttt 
atgaggttta 
gtcttattaa 
atcaacttgt 
cagtcttctc 
ttgtccaaag 
ccaaactaag 
ttgcgcctct 



tatcttgacg 
taaattagcc 
gcacacgtga 
tcctgttacc 
tggaattatt 
gaaggacaaa 
tggactcttc 
gatgcataat 



L4 ANSWER 67 OF 86 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



AK004390 
AK004390 

AK004390.1 Gl: 12835554 
322059-94-9 
1013 

mRNA ; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 18-day embryo whole body cDNA, RIKEN 
full-length enriched library, clone : 1110067B02 
product : weakly similar to ***semaphorin*** 

*** 6A *** PRECURSOR ( ***SEMAPHORIN*** *** V IA 
) (SEMA VIA) (SEMAPHORIN Q) (SEMA Q) [MUS musculus], 
full insert sequence. 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

Please visit our web site (http://genome.gsc.riken.jp/) for further 
details . 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. First strand cDNA was primed with a primer 
[5' GAGAGAGAGAGCGGCCGCAACTCGAGTTTTTTTTTTTTTTTTVN 3'L CDNA was 
prepared by using trehalose thermo-activated reverse transcriptase 
and subsequently enriched for full-length by cap-trapper. Second 
strand cDNA was prepared with the primer adapter of sequence[5' 
GAGAGAGAGAAGGATCCAAGAGCTCAATTAATTTAATTAAACCCCCCCCCCC 3 ' ] . CDNA Was 
cleaved with xhol and Sstl. cloning sites, 5' end: Ssti; 3 1 end: 
xhol. Host: SOLR. 

1 

Carninci,P.; Hayashizaki , Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

CarnincijP.; shibata,Y.; Hayatsu,N.; Sugahara,Y.; 
shibata,K.; itoh,M.; Konno,H.; okazaki.Y.; 
Muramatsu.M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 
cDNAs to prepare full-length cDNA libraries for rapid 
discovery of new genes 
Genome Res . , 10 (10) , 1617-1630 (2000) 
CA 134:305920 
3 

Shibata.K.; ltoh,M.; Aizawa,K.; Nagaoka.s.; Sasaki ,N. 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) : 
OTHER SOURCE (OS) 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



Carmnci ,p. 
Kitsunai j. 
Nakamura,s. 



Konno.H.; Akiyama.J.; Nishi,K.; 
Tashiro,H.; Itoh f M.; Sumi,N.; lshii,Y. 
Hazama,M. ; Nishine f T.; Harada,A.; 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO) 



Yamamoto , R . ; Matsumoto,H. ; Sakaguchi ,s. ; Ikegami ,T. ; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
ishikawa.T.; Ozawa,K.; Tanaka,T.; Matsuura,S.; 
Kawai.J.; okazaki,Y.; Muramatsu ,M. ; lnoue,Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system--384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

CA 134:203311 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 1013) 

Adachi,J.; Aizawa,K.; Akahira,s.; Akimura,T.; Arai,A.; 
Aono,H.; Arakawa,T.; Bono,H.; Carninci,P.; Fukuda,s.; 
Fukunishi , Y. ; Furuno,M.; Hanagaki,T.; Hara,A. ; 
Hayatsu,N.; Hiramoto,K.; Hiraoka,T.; Hori,F.; 
lmotani,K.; lshii,Y.; ltoh,M.; lzawa,M.; Kasukawa,T.; 
Kato,H.; Kawai,J.; Kojima,Y.; Konno,H.; Kouda,M.; 
Koya,S.; Kurihara,C; Matsuyama,T, ; Miyazaki,A.; 
Nishi.K.; Nomura.K.; Numazaki.R.; Ohno,M.; okazaki,Y.; 
Okido.T.; Owa,C; Saito f H.; Saito,R.; Sakai,C; 
Sakai,K.; Sano,H.; Sasaki, D.; shibata,K.; Shibata,Y.; 
shinagawa,A. ; shiraki,T.; Sogabe,Y.; Suzuki, H.; 
Tagami,M.; Tagawa,A.; Takahashi , F, ; Tanaka,T.; 
Tejima,Y.; Toya,T.; Yamamura,T. ; Yasunishi ,A. ; 
Yoshida,K.; Yoshino,M.; Muramatsu, M. ; Hayashizaki ,Y. 
Direct Submission 

Submitted (10-JUL-2000) Yoshihide Hayashizaki, The 
Institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . j p/ , 
Tel =81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT) 
Feature Key 



Location 



Qua! ifier 



source 



1..1013 



misc-feature 



1. .1013 



/orqanism="Mus musculus" 
/moT-type= ,, mRNA ,, 
/strain="C57BL/6J n 
/db-xref =" FANTOM-DB : 1110067B02 " 
/db-xref ="taxon : 10090" 
/clone= M 1110067B02 n 
/tissue-type="whole body" 
/clone-! ib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="18-day embryo" 
/note="weakly similar to 
SEMAPHORIN 6A PRECURSOR 
(SEMAPHORIN VIA) (SEMA VIA) 
(SEMAPHORIN Q) (SEMA Q) [MUS 
musculus] (SWISSPROT 1 035464, 
evidence: FASTY, 60.6%ID, 
64.8%length, match=1719)" 



SEQUENCE (SEQ): 

1 gcagccagca cctgaggact tcttatacag tcataaagcc acctagagcg atttgttgtt 
61 gagcaacccg ccccttccct ccaggatcag aagcacacca cccttgccat gataaacatt 
121 ccatcccctg ctgttctgat aatgtgaagt cagatgaggg ttcccaggtt atgtggcacc 
181 tgcggaaacc atgtctagag tcgtttctat gtagcttggc agagcccact gatggctgca 



241 ggtgtgtagc 
301 acttactcta 
361 gtgatatctt 
421 accagatcat 
481 cattcttttc 
541 cttcagcttt 
601 gcactacaca 
661 cgtggtaact 
721 actgtgagtt 
781 gttaatttaa 
841 tttaatggta 
901 agtactttaa 
961 actttggggt 



ctactgacag 
gaaatgtcca 
gggacagtga 
ttttatacag 
tatagaaaaa 
gcaactaaaa 
tctgccacca 
ctgtgtgacc 
ctatcagtat 
ttttaaatgc 
cggcataaaa 
cgtgtataga 
tttaacaccg 



ctttggttaa 
cagctaagga 
ccatggtctg 
gatgtagagt 
tgattccctg 
atattacagt 
acaatgttgt 
ccgataacat 
ttatgttgaa 
tttatccatt 
aaataaccct 
tatcttgtaa 
catgtaaagt 



cccactgcac 
aggctcctga 
cattacttgg 
actgatgcgg 
tgatcttctg 
ttaataatca 
ttgcatttac 
tttgtaacat 
atttctaaca 
tgtgcaaagg 
aaagtgaagt 
acttgtattg 
caaaataaaa 



atcacccagt 
gccagtcagg 
gggaggatgg 
ttgatctttt 
gaagctccaa 
attaaaccaa 
cttaccaata 
tgtgctgcct 
tggattctag 
taaacacaga 
ggctctatac 
tggatgtgta 
tatccaagtc 



catttaccca 
cggagttcaa 
ggtacaagac 
ccttcaagaa 
agctgaaacc 
ccaacaataa 
ttaatcccag 
tagagtttgt 
tctctattct 
ttgtatcttt 
tgttttatag 
aataatatgt 
att 
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LOCUS (loc): 
GenBank acc. NO. (GBN) 
GenBank version (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (r) 



AF288666 
AF288666 
AF288666 .1 GI : 11093908 
301990-09-0 
3018 

mRNA ; linear 

Rodents 

11 Dec 2000 

Mus musculus axon guidance signal SEMA6A1 mRNA, 
complete cds. 
house mouse. 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
: 774 a 859 c 744 g 641 t 

1 (bases 1 to 3018) 

Klostermann, A. ; Lutz,B.; Gertler,F.; Behl,c. 

The orthologous human and murine ***semaphorin*** 

***6a*** - ***]*** proteins (SEMA6A-l/Sema6A-l) 
bind to the enabled/vasodilator-stimulated 
phosphoprotein-like protein (EVL) via a novel 
carboxyl -terminal zyxin-like domain 
3 . Biol. Chem., 275 (50), 39647-39653 (2000) 
CA 134:189679 

2 (bases 1 to 3018) 
Klostermann, A, ; Behl ,c. 
Direct Submission 

Submitted (21-JUL-2000) Independent Research Group 
Neurodegeneration, MPI of Psychiatry, Kraepelinstrasse 
2-10, Munich 80804, Germany 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 1..3018 /organism= M Mus musculus" 

/db-xref="taxon : 10090" 
/ti ssue-type="brai n" 

CDS 1 . . 3018 /codon-start=l 

/product="axon guidance signal 
SEMA6A1" 

/protei n-i d="AAG29494 . 1" 

/db-xref="Gl : 11093909" 

/transl ati on="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFNPSCRNYRVDTLETFGDEFS 

GMARCPYDAKHVNIALFADGKLYSATVTDFLAID 

AVIYRSPGDSPTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTS F 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

VLATFSTPYNSIPGSAVCAYDMLD 

IADVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTHPLMDEAVPSIINRPWFLRTMVRYRLTKIAVD 

NAAGPYQNHTWFLEEMNVYNPEK 

CSYDGVEDKRIMGMQLDRASGS LYVAFSTCVIKV 



PLGRCERHGKCKKTCIASRDPYCG 

WVRESGSCAHLSPLSRLTFEQDIERGNTDGLGDC 

HNS FVALNGHASS LYPNTTTSDSA 

SRDGYESRGGMLDWNDLLEAPGSTDPLGAVSSHN 

HQDKKGVIRESYLKSNDQLVPVTL 

LAIAVI LAFVMGAVFSGI IVYCVCDH RRKDVAW 

RRKEKELTHSRRGSMSSVSELSGL 

FGDTQS KD PK P EAI LT P LMH NGK L ATS S NTAKM L 

IKADQHHLDLTALPTPESTPTLQQ 

KRKPNRGSREWERNQNIINACTKDMPPMGSPVIP 

TDLPLRASPSHIPSVWLPITQQG 

YQHEYVDQPKMSEWAQMALEDQAATLEYKTTKE 

HLSSKSPNHGVNLVENLDSLPPKV 

PQREASLGPPGTSLSQTGLSKRLEMQHSSSYGLE 

YKRSYPTNSLTRSHQTTTLKRNNT 

NSSNSSHLSRNQSFGRGDNPPPAPQRVDSIQVHS 

SQPSGQAVTVSRQPSLNAYNSLTR 

SGLKRTPSLKPDVPPKPS FAPLSTSMKPNDACT" 



SEQUENCE (SEQ) : 

1 atgcggccag 
61 ccagaagatt 
121 tttgtgggcc 
181 atcatgatca 
241 atagacacat 
301 caggctgacg 
361 attaaagttc 
421 aacccttcct 
481 ggaatggcca 
541 aaactctact 
601 cccggagaca 
661 tactttgtcc 
721 gtagaataca 
781 gacatgggag 
841 ctgaactgct 
901 gatgtgattc 
961 agcatcccag 
1021 gggaggttca 
1081 gtccctaagc 
1141 tccaatgagt 
1201 gcagtgcctt 
1261 accaaaattg 
1321 gaggagatga 
1381 atcatgggca 
1441 gtgatcaagg 
1501 gcctccagag 
1561 ccccttagca 
1621 gactgtcaca 
1681 actacgtcag 
1741 aacgacctgc 
1801 caccaggaca 
1861 cctgtcaccc 
1921 ggcatcatcg 
1981 aaggagaaag 
2041 ggcctctttg 
2101 atgcacaacg 
2161 cagcatcacc 
2221 aaacggaaac 
2281 tgcaccaagg 
2341 gcctccccaa 
2401 cacgagtacg 
2461 caggctgcca 
2521 catggggtga 
2581 gcctccctag 
2641 atgcaacact 
2701 acaagaagcc 
2761 cacctctcca 
2821 gtggactcta 
2881 cagcccagcc 
2941 ctaaagcctg 
3001 aatgatgcgt 



cagccttact 
ccgagccaat 
acaagccagg 
tgaacagaac 
cccacacaga 
tagacacatg 
ttctcaagaa 
gcagaaacta 
gatgccctta 
cggctacagt 
gccctaccct 
aagccgtgga 
acactatggg 
ggtctcagag 
cggtgcctgg 
gcattaatgg 
gttctgcagt 
aggaacagaa 
ccaggccagg 
ttcccgatga 
ccatcatcaa 
cagtagacaa 
atgtttacaa 
tgcagctcga 
tgcctcttgg 
acccgtattg 
gactgacatt 
attccttcgt 
attcggcatc 
tcgaggcacc 
agaagggagt 
tcctggccat 
tgtattgtgt 
agctcactca 
gggacaccca 
gcaagctggc 
tagacctcac 
ccaaccgcgg 
acatgcctcc 
gccacatccc 
tagatcagcc 
ccctggagta 
accttgtgga 
gtcccccggg 
cctcctccta 
accagaccac 
ggaaccagag 
tccaggtgca 
tcaatgccta 
atgtaccccc 
gtacataa 



gctgtgtctc 
cagtatttcg 
acggaacacc 
cctctacgtt 
agaaatttac 
caggatgaag 
gaatgatgat 
cagggtcgat 
tgatgccaaa 
gactgacttt 
caggactgtc 
ttatggggac 
aaaggttgtt 
agtcctggag 
agactctcat 
ccgtgatgtt 
ctgtgcctat 
atcacctgac 
ctgttgtgct 
taccctgaac 
cagaccttgg 
cgctgccggg 
cccagaaaag 
cagagcgagt 
ccgctgtgag 
tgggtgggta 
tgagcaggac 
ggcactgaat 
ccgagacggg 
tggcagcaca 
gattcgggaa 
tgcagtcatt 
gtgcgatcac 
ctcgcgtcgg 
gtccaaggac 
cacgtctagc 
cgccctgccc 
cagtcgcgag 
catgggttcc 
cagcgtggtg 
caaaatgagc 
taagaccacc 
gaacctggac 
aacctcactg 
tgggctcgaa 
cactctcaaa 
ctttggccgg 
cagctcccag 
caactcactg 
caaaccttcc 



acactgctac 
catggcaact 
acgcagaggc 
gctgctcgag 
tgtagcaaaa 
gggaaacata 
acgctgtttg 
accttggaaa 
catgtcaaca 
ctggccattg 
aagcatgatt 
tatatctact 
ttccctaggg 
aagcagtgga 
ttttatttca 
gtcttggcaa 
gacatgcttg 
tctacctgga 
ggatcatcct 
ttcattaaga 
ttcctgagaa 
ccatatcaga 
tgcagctatg 
ggctcactct 
cgacatggga 
agggaaagtg 
attgagcgtg 
gggcacgcca 
tatgagtcta 
gaccctttgg 
agttacctca 
ctggcttttg 
cggcgcaaag 
ggatctatga 
ccaaagcctg 
aacaccgcca 
accccagagt 
tgggagagga 
cctgtgattc 
gtcctgccca 
gaggtggtgg 
aaagagcacc 
agcctgcccc 
tcacagaccg 
tataagagga 
agaaacaata 
ggagacaacc 
ccctctggcc 
acgaggtcgg 
tttgctcccc 



actgcgccgg 
atacaaaaca 
acaggctgga 
accatattta 
aactgacatg 
aggatgaatg 
tctgtggaac 
cttttgggga 
tcgctctgtt 
atgcggtcat 
caaagtggtt 
tcttcttcag 
tggctcaggt 
catctttcct 
atatactcca 
ccttttccac 
acattgctga 
cacccgttcc 
ctttagaaaa 
cgcatccact 
caatggtcag 
atcacactgt 
atggtgtaga 
atgttgcatt 
agtgtaaaaa 
gttcctgtgc 
gcaatacgga 
gttccctcta 
ggggaggcat 
gggcagtgtc 
aaagcaacga 
tcatgggggc 
acgtggcagt 
gcagtgtctc 
aggccatcct 
agatgctcat 
ccaccccgac 
accagaacat 
ccacggacct 
tcacgcagca 
ctcagatggc 
tgagtagtaa 
ctaaagttcc 
gcctgagcaa 
gctaccccac 
ctaactcctc 



ggcgggtttc 
gtatccggtg 
catccagatg 
tactgttgat 
gaaatctaga 
tcacaacttc 
caatgccttc 
tgaatttagc 
tgcagatgga 
ttacaggagc 
gaaagagccg 
agaaattgca 
ctgtaagaat 
gaaggctcgc 
ggcagttaca 
accttataac 
tgttttcact 
agacgaacga 
atatgcaacc 
catggacgag 
ataccgcctg 
ggttttcctg 
agacaaaagg 
ctctacttgt 
aacctgcatc 
ccatctgtca 
cggcctagga 
tcccaacacc 
gctggactgg 
ctctcataac 
ccagcttgtt 
cgtcttctcg 
agtgcggcgc 
cgagctcagt 
cacaccactc 
caaggctgac 
actgcagcag 
catcaatgcc 
gcccctccgg 
gggctaccag 
actggaggac 
gagtcccaac 
acagcgcgag 
gaggctggag 
gaactcgctc 
caattcctcc 



caccccccgc cccgcagcgg 
aggccgtgac tgtttcgagg 
ggctgaagcg caccccctcg 
tttccacatc catgaagccc 
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GEN BANK . RTM . COPYRIGHT 2004 on STN 



LOCUS (LOC) : 
GenBank ACC. no. 



AF279656 
(GBN): AF279656 



GenBank (R) 



GenBank version (ver) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 
SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) : 



AF279656.1 Gl:11093650 

386088-74-0 

3862 

mRNA; linear 
Primates 
11 Dec 2000 

Homo sapiens semaphorin SEMA6A1 mRNA, complete cds. 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 971 a 1111 c 967 g 813 t 

1 (bases 1 to 3862) 

Klostermann,A. ; Lutz,B.; Gertler,F.; Behl ,C. 

The orthologous human and murine ***semaphorin*** 

*ft* 6A *** _ ***!*** proteins (SEMA6A-l/Sema6A-l) 
bind to the enabled/vasodilator-stimulated 
phosphoprotein-like protein (EVL) via a novel 
carboxyl -terminal zyxin-like domain 
J. Biol. Chem., 275 (50), 39647-39653 (2000) 
CA 134:189679 

2 (bases 1 to 3862) 

Klostermann,A. ; Lutz,B.; Gertler,F.; Behl,C. 
Direct Submission 

Submitted (19-JUN-2000) Independent Research Group 
Neurodegeneration, MPI of Psychiatry, Kraepelinstrasse 
2-10, Munich 80804, Germany 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 



source 



CDS 



1. .3862 



658. .3750 



sapiens 
:9606" 



/organism="Homo 
/db-xref="taxon: 
/chromosome="5" 
/map= M 5q21-22" 
/note= M guidance signal" 
/codon-start=l 

/product="semaphorin SEMA6A1" 

/protei n-i d="AAG29378 . 1" 

/db-xref="Gl: 11093651" 

/translation="MRSEALLLYFTLLHFAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 

TNAFNPSCRNYKMDTLEPFGDEFS 

GM ARC P YDAKH AN VAL FADGK LYSATVTD F LAI D 

AVIYRSLGESPTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQWTSF 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

VLATFSTPYNSIPGSAVCAYDMLD 

IASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLERYATSNEFPDDTLNFI 

KTH PLMDEAVPS I FN R PWFLRTMVRYR LTKI AVD 

TAAGPYQNHTWFLGSEKGIILKF 

LARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 

KRIMGMQLDRASSSLYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGAC 

SHLSPNSRLTFEQDIERGNTDGLG 

DCHNS FVALNGHSSS LLPSTTTSDSTAQEGYES R 

GGMLDWKHLLDSPDSTDPLGAVSS 

HNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILA 

FVMGAVFSGITVYCVCDHRRKDVA 

WQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKD 

PKPEAILTPLMHNGKLATPGNTAK 

MLIKADQHHLDLTALPTPESTPTLQQKRKPSRGS 

REWERNQN LINACTKDMPPMGS PV 

I PTD L PL RAS PS H I PS VWL PITQQGYQH EYVDQ 

PKMSEVAQMALEDQAATLEYKTIK 

EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGP 

PGASLSQTGLSKRLEMHHSSSYGV 

DYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 

RNQS FGRGDN P P PA PQRVDS IQVH 

S SQPSGQAVTVS RQPS LNAYNS LTRSGLKRTPS L 



KPDVPPKPSFAPLSTSMKPNDACT " 



SEQUENCE (SEQ) : 

1 ggcacgaggc tgcagccaac tccgctcccc gcgcactcgg ctgcccaggc gctcggaacc 
61 cagcagcggc gctcctccgc ggtgccggtc gcccgcgatg cccgcttagc agcgtgtagc 
121 agcggccagc atcaccacac ccgcggcacc gcgctgccgg ccgcagagcc gggccagagc 
181 cttgcccccc tcccccagcc cccaccccgc cccccgccct gaaatgactt gttaatcggc 
241 gcagacacca ccaaggggac tcaccgaagt ggaatccaag tggaatttgg atttggagaa 
301 gagtttcttg aacatttacc ctcttccttg ttggttttct ttttcttttt cttctttttt 
361 tttttggctt cttttttcct ctccccttct ccgctcgtca ttggagatga acacatcgcg 
421 tttgcatccc agaaagtagt cgccgcgact atttccccca aagagacaag cacacatgta 
481 ggaatgacaa aggcttgcga aggagagagc cgcagccgcg gcccggagag atcccctcga 
541 taatggatta ctaaatggga tacacgctgt accagttcgc tccgagcccc ggccgcctgt 
601 ccgtcgatgc accgaaaagg gtgaagtaga gaaataaagt ctccccgctg aactactatg 
661 aggtcagaag ccttgctgct atatttcaca ctgctacact ttgctggggc tggtttccca 
721 gaagattctg agccaatcag tatttcgcat ggcaactata caaaacagta tccggtgttt 
781 gtgggccaca agccaggacg gaacaccaca cagaggcaca ggctggacat ccagatgatt 
841 atgatcatga acggaaccct ctacattgct gctagggacc atatttatac tgttgatata 
901 gacacatcac acacggaaga aatttattgt agcaaaaaac tgacatggaa atctagacag 
961 gccgatgtag acacatgcag aatgaaggga aaacataagg atgagtgcca caactttatt 
1021 aaagttcttc taaagaaaaa cgatgatgca ttgtttgtct gtggaactaa tgccttcaac 
1081 ccttcctgca gaaactataa gatggataca ttggaaccat tcggggatga attcagcgga 
1141 atggccagat gcccatatga tgccaaacat gccaacgttg cactgtttgc agatggaaaa 
1201 ctatactcag ccacagtgac tgacttcctt gccattgacg cagtcattta ccggagtctt 
1261 ggagaaagcc ctaccctgcg gaccgtcaag cacgattcaa aatggttgaa agaaccatac 
1321 tttgttcaag ccgtggatta cggagattat atctacttct tcttcaggga aatagcagtg 
1381 gagtataaca ccatgggaaa ggtagttttc ccaagagtgg ctcaggtttg taagaatgat 
1441 atgggaggat ctcaaagagt cctggagaaa cagtggacgt cgttcctgaa ggcgcgcttg 
1501 aactgctcag ttcctggaga ctctcatttt tatttcaaca ttctccaggc agttacagat 
1561 gtgattcgta tcaacgggcg tgatgttgtc ctggcaacgt tttctacacc ttataacagc 
1621 atccctgggt ctgcagtctg tgcctatgac atgcttgaca ttgccagtgt ttttactggg 
1681 agattcaagg aacagaagtc tcctgattcc acctggacac cagttcctga tgaacgagtt 
1741 cctaagccca ggccaggttg ctgtgctggc tcatcctcct tagaaagata tgcaacctcc 
1801 aatgagttcc ctgatgatac cctgaacttc atcaagacgc acccgctcat ggatgaggca 
1861 gtgccctcca tcttcaacag gccatggttc ctgagaacaa tggtcagata ccgccttacc 
1921 aaaattgcag tggacacagc tgctgggcca tatcagaatc acactgtggt ttttctggga 
1981 tcagagaagg gaatcatctt gaagtttttg gccagaatag gaaatagtgg ttttctaaat 
2041 gacagccttt tcctggagga gatgagtgtt tacaactctg aaaaatgcag ctatgatgga 
2101 gtcgaagaca aaaggatcat gggcatgcag ctggacagag caagcagctc tctgtatgtt 
2161 gcgttctcta cctgtgtgat aaaggttccc cttggccggt gtgaacgaca tgggaagtgt 
2221 aaaaaaacct gtattgcctc cagagaccca tattgtggat ggataaagga aggtggtgcc 
2281 tgcagccatt tatcacccaa cagcagactg acttttgagc aggacataga gcgtggcaat 
2341 acagatggtc tgggggactg tcacaattcc tttgtggcac tgaatgggca ttccagttcc 
2401 ctcttgccca gcacaaccac atcagattcg acggctcaag aggggtatga gtctagggga 
2461 ggaatgctgg actggaagca tctgcttgac tcacctgaca gcacagaccc tttgggggca 
2521 gtgtcttccc ataatcacca agacaagaag ggagtgattc gggaaagtta cctcaaaggc 
2581 cacgaccagc tggttcccgt caccctcttg gccattgcag tcatcctggc tttcgtcatg 
2641 ggggccgtct tctcgggcat caccgtctac tgcgtctgtg atcatcggcg caaagacgtg 
2701 gctgtggtgc agcgcaagga gaaggagctc acccactcgc gccggggctc catgagcagc 
2761 gtcaccaagc tcagcggcct ctttggggac actcaatcca aagacccaaa gccggaggcc 
2821 atcctcacgc cactcatgca caacggcaag ctcgccactc ccggcaacac ggccaagatg 
2881 ctcattaaag cagaccagca ccacctggac ctgacggccc tccccacccc agagtcaacc 
2941 ccaacgctgc agcagaagcg gaagcccagc cgcggcagcc gcgagtggga gaggaaccag 
3001 aacctcatca atgcctgcac aaaggacatg ccccccatgg gctcccctgt gattcccacg 
3061 gacctgcccc tgcgggcctc ccccagccac atccccagcg tggtggtcct gcccatcacg 
3121 cagcagggct accagcatga gtacgtggac cagcccaaaa tgagcgaggt ggcccagatg 
3181 gcgctggagg accaggccgc cacactggag tataagacca tcaaggaaca tctcagcagc 
3241 aagagtccca accatggggt gaaccttgtg gagaacctgg acagcctgcc ccccaaagtt 
3301 ccacagcggg aggcctccct gggtcccccg ggagcctccc tgtctcagac cggtctaagc 
3361 aagcggctgg aaatgcacca ctcctcttcc tacggggttg actataagag gagctacccc 
3421 acgaactcgc tcacgagaag ccaccaggcc accactctca aaagaaacaa cactaactcc 
3481 tccaattcct ctcacctctc cagaaaccag agctttggca ggggagacaa cccgccgccc 
3541 gccccgcaga gggtggactc catccaggtg cacagctccc agccatctgg ccaggccgtg 
3601 actgtctcga ggcagcccag cctcaacgcc tacaactcac tgacaaggtc ggggctgaag 
3661 cgtacgccct cgctaaagcc ggacgtaccc cccaaaccat cctttgctcc cctttccaca 
3721 tccatgaagc ccaatgatgc gtgtacataa tcccaggggg agggggtcag gtgtcgaacc 
3781 agcaggcaag gcgaggtgcc cgctcagctc agcaaggttc tcaactgcct cgagtaccca 
3841 ccagaccaag aaggcctgcg gc 

L4 ANSWER 70 OF 86 GENBANK.RTM. COPYRIGHT 2004 on STN 

LOCUS (LOC): BF015492 GenBank (R) 

GenBank ACC. NO. (GBN) : BF015492 

GenBank VERSION (VER) : BF015492.1 GI: 10746824 

CAS REGISTRY NO. (RN) : 297040-29-0 

SEQUENCE LENGTH (SQL) : 403 



MOLECULE TYPE (Cl) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



SOURCE : 
ORGANISM (ORGN): 



mRNA; linear 
Expressed sequence tag 
29 Dec 2000 

uy22c05.yl NCl_CGAP_Lu30 Mus musculus cdna clone 
IMAGE =3660296 5' similar to TR:035464 035464 

***SEMAPHORIN*** *** V IA*** PRECURSOR. mRNA 

sequence, 
house mouse. 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
NUCLEIC ACID COUNT (NA) : 117 a 90 c 97 g 99 t 
COMMENT : 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: Gilbert smith, Ph.D. 
cDNA Library Preparation: Life Technologies, inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Washington University Genome Sequencing center 
clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

i mage . 11 nl . gov/i mage/html /i resou rces . shtml 

MGI: 1421064 

Trace considered overall poor quality 
Seq primer: -40RP from Gibco 
High quality sequence stop: 1. 
REFERENCE: 1 (bases 1 to 403) 

author (au) : nci-cgap http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (Tl) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP) , Tumor Gene Index 
journal (so): unpublished (1997) 



FEATURES (FEAT) 
Feature Key 



Location 



Qualifier 



source 1..403 /organism="Mus musculus" 

/strain="CZECH ir" 
/db-xref ="taxon : 10090" 
/clone="lMAGE: 3660296" 
/cl one-1 i b= n NCI-CGAP-Lu30" 
/tissue-type="tumor , metastatic to 
mammary" 

/lab-host="DHl0B M 
/note="0rgan : lung; Vector: 
PCMV-SPORT6; site-1: Noti; site-2: 
Sail; transgenic model WNT-1, 
expression driven by MMTV-LTR 
enhancer; cloned unidi rectionally . 

Primer: oliqo dT. Library 
constructed By Life Technologies, 
investigator providing samples: 
Gilbert Smith, NIH" 

SEQUENCE (SEQ): 

1 tttagaaaaa tatgcaacct ccaatgagtt tcccgatgat accctgaact taattaagac 
61 gcatccactc atggacgagg cagtgccttc catcatcaac agaccttggt tcctgagaac 
121 aatggtgaga taccgcctga ccaaaattgc agtagacaac gctgccgggc catatcagaa 
181 tcacactgtg gttttcctgg gatcagaaaa gggaatcatc ctgaagttct tggccaggat 
241 aggaagcagt ggtttcctaa atggcagcct tttcctggag gagatgaatg tttacaaccc 
301 agaaaagtgc agctatgatg gcgtagaaga caaaaggatc atgggcatgc agctcgacag 
361 agcgagtggc tcactctatg ttgcattctc tacttgtgtg att 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



GenBank (R) 
Gl: 10432889 



AK021660 
AK021660 
AK02 1660.1 
390309-92-9 
1588 

mRNA; linear 
Primates 
30 Jan 2004 

Homo sapiens cDNA FLJ11598 fis, clone HEMBA1003866, 
moderately similar to Mus musculus ***semaphorin*** 
***via*** mRNA. 



(ST) 



KEYWORDS 
SOURCE: 
ORGANISM (ORGN) : 



REFERENCE: 

AUTHOR (AU); 



oli go capping; fis (full insert sequence) 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

COMMENT: 

NEDO human cDNA sequencing project supported by Ministry of 
international Trade and Industry of Japan; cDNA full insert 
sequencing: Research Association for Biotechnology; cDNA library 
construction, 5'- & 3'-end one pass sequencing and clone selection: 
Helix Research Institute (supported by Japan Key Technology center 
etc.) and Department of virology, institute of Medical Science, 
University of Tokyo. 

1 

ota,T.; Suzuki, Y.; Nishikawa,T. ; otsuki,T.; 
sugiyama,T.; lrie,R.; Wakamatsu,A. ; Hayashi,K.; 
Sato,H.; NagaijK.; Kimura,K.; Makita,H.; Sekine,M.; 
obayashi,M.; Nishi,T.; Shibahara,T. ; Tanaka,T. ; 
lshii,s.; Yamamoto,J.; saito,K.; Kawai,Y.; lsono,Y.; 
Nakamura,Y.; Nagahari,K.; Murakami, K.; Yasuda,T.; 
iwayanagi ,T. ; Wagatsuma,M. ; shi ratori ,A. ; Sudo,H.; 
Hosoiri,T.; Kaku,Y.; Kodaira,H.; Kondo,H.; Sugawara,M.; 
Takahashi ,M. ; Kanda,K.; Yokoi,T.; Furuya.T.; 
Kikkawa,E.; omura,Y. ; Abe,K.; Kamihara.K. ; Katsuta,N.; 
sato,K.; Tanikawa,M.; Yamazaki,M.; Ninomiya,K.; 
Ishibashi ,T. ; Yamashita,H. ; Murakawa.K.; Fujimori , K. ; 
Tanai,H.; Kimata,M.; Watanabe,M.; Hiraoka,s.; chiba,Y.; 
lshida,s.; Ono,Y.; Takiguchi ,S. ; Watanabe,s 
Yosida,M.; Hotuta,T.; Kusano,J.; Kanehori,K 
Takahashi -Fuji i , A. ; Hara,H.; Tanase,T.; Nomura,Y.; 
Togiya,s.; Komai,F.; Hara,R.; Takeuchi,K.; Arita,M.; 
lmose,N.; Musashino,K. ; Yuuki,H.; oshima,A. ; Sasaki, N. 
Aotsuka,S.; Yoshikawa, Y. ; Matsunawa,H. ; lchihara,T.; 
Shiohata,N.; Sano,s.; Moriya,s.; Momiyama,H.; satoh,N. 
Takami,S.; Terashima,Y. ; Suzuki ,0.; Nakagawa,S.; 
Senoh.A.; Mizoguchi ,H. ; Goto.Y.; shimizu,F.; Wakebe,H. 
Hishigaki ,H. ; Watanabe,T.; Sugiyama,A. ; Takemoto,M.; 
Kawakami,B.; Yamazaki,M.; Watanabe,K.; Kumagai,A.; 
Itakura,s.; Fukuzumi.Y.; Fujimori, Y. ; Komiyama,M.; 
Tashiro,H.; Tanigami,A.; Fujiwara,T.; Ono,T.; 
Yamada,K.; Fujii,Y.; ozaki,K.; Hirao,M.; Ohmori,Y.; 
Kawabata,A.; Hikiji,T.; Kobatake,N.; Inagaki,H. ; 
lkema,Y.; okamoto,S.; okitani,R.; Kawakami,T.; 
Noguchi,S.; Itoh,T.; shigeta,K.; senba,T.; 
Matsumura,K. ; Nakajima,Y.; Mizuno,T.; Morinaga,M.; 
Sasaki, M.; Togas hi, T. ; Oyama,M.; Hata,H.; Watanabe.M.; 
Komatsu,T.; Mizushima-Sugano, J . ; Satoh,T.; Shirai,Y.; 
Takahashi ,Y. ; Nakagawa,K.; okumura,K. ; Nagase.T.; 
Nomura, N.; Kikuchi,H.; Masuho,Y.; Yamashita,R. ; 
Nakai.K.; Yada,T. ; Nakamura,Y.; Ohara,o.; lsogai,T.; 
Sugano,s. 

Complete sequencing and characterization of 21,243 
full-length human cDNAs 
Nat. Genet., 36 (1), 40-45 (2004) 
(os): CA 140:158332 
2 

lsogai,T.; ota,T. ; Hayashi,K.; sugiyama,T. ; otsuki,T.; 
Suzuki, Y. ; Nishikawa,T. ; Nagai,K.; sugano,s.; 
Shi ratori , A. ; Sudo,H.; Wagatsuma,M. ; Hosoiri,T.; 
Kaku , Y . ; Kodai ra , H . ; Kondo , H . ; sugawara , M . ; 
Takahashi ,M. ; Chiba,Y.; lshida,s.; Murakawa,K.; Ono,Y. 
Takiguchi ,s. ; Watanabe,s.; Kimura,K.; Murakami,K.; 
Ishii,s.; Kawai,Y.; Saito,K.; Yamamoto,J.; 
Wakamatsu.A. ; Nakamura, Y. ; Nagahari,K.; Masuho,Y.; 
Ninomiya,K.; iwayanagi, T. 
NEDO human cDNA sequencing project 
Unpublished 



TITLE (Tl): 

JOURNAL (SO): 
OTHER SOURCE 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) ; 



3 (bases 1 to 1588) 
lsogai,T.; otsuki,T. 
Direct submission 

submitted (23-AUG-2000) Takao Isogai , Helix Research 
Institute, Genomics Laboratory; 1532-3 Yana, Kisarazu, 
Chiba 292-0812, Japan (E-mail :genomics@hri .co .jp, 
Tel =81-438-52-3975, Fax:81-438-52-3986) 



FEATURES (FEAT): 

Feature Key Location 

source 1..1588 



CDS 



65. .1168 



Qual ifier 

/orqanism="Homo sapiens" 

/moT-type="mRNA" 

/db-xref ="taxon : 9606" 

/cl one="HEMBAl003866" 

/tissue-type="whole embryo, mainly 

head" 

/clone-lib="HEMBAl" 

/dev-stage="embryo, 10 weeks" 

/note="cloning vector: pMEl8SFL3" 

/note="unnamed protein product" 

/codon-start=l 

/protei n-i d="BABl3869 . 1" 

/db-xref="GI : 10432890" 

/translation="MKGKHKDECHNFIKVFVPRN 

DEMVFVCGTNAFN PMCRYYRLSTL 

EYDGEEISGLARCPFDARQTNVALFADGKLYSAT 

VAD F LAS DAVI YRSMGDGS A L RT I 

KYDSKWIKEPHFLHAIEYGNYVYFFFREIAVEHN 

N LGKAVYS RVARICKNDMGGSQRV 

LEKHWTS FLKARLNCSVPGDS FFYFDVLQSITDI 

IQINGIPTWGVFTTQLNSIPGSA 

VCAFSMDDIEKVFKGRFKEQKTPDSVWTAVPEDK 

VPKPRPGCCAKHGLAEAYKTSIDF 

PDETLSFIKSHPLMDSAVPPIADEPWFTKTRVRY 

RLTAISVDHSAGPYQNYTVIFVGS 

EAGMVLKVLAKTSPFSLNDSVLLEEIEAYNHAK" 



SEQUENCE (SEQ): 

1 agaagtaata 
61 tgctatgaaa 
121 aaacgatgag 
181 caggttgagt 
241 tgatgccaga 
301 ggctgacttc 
361 tcgcacaata 
421 atatggaaac 
481 caaggctgtg 
541 ggtcctggag 
601 agattcgttt 
661 catccccact 
721 ctgtgcattt 
781 aactccagat 
841 ctgttgtgca 
901 aactctgtca 
961 tgagccctgg 
1021 ttcagccgga 
1081 acttaaagtt 
1141 gattgaagcc 
1201 tcaaaaattt 
1261 tggtttggca 
1321 tttttcccga 
1381 gcccttgttt 
1441 gatatattcc 
1501 aatatacatg 
1561 taataagtaa 



cccaacaaga 
ggcaagcata 
atggtttttg 
accttagaat 
caaaccaatg 
ttggccagcg 
aaatatgatt 
tatgtctatt 
tattcccgcg 
aaacactgga 
ttctactttg 
gtggtcgggg 
agcatggatg 
tctgtttgga 
aaacacggcc 
ttcatcaaat 
ttcacaaaga 
ccctaccaga 
ctggcaaaga 
tacaaccatg 
tcggcatgta 
taatagtgtt 
gcctgctagg 
ttcacctgta 
aagatgctac 
cacatacata 
gagaatattt 



aactgacatg 
aagatgaatg 
tttgtggtac 
atgatgggga 
ttgccctctt 
atgccgttat 
ccaaatggat 
tcttctttcg 
tggcccgcat 
cttcatttct 
atgttctgca 
tgtttaccac 
acattgaaaa 
cagcagttcc 
ttgccgaagc 
ctcatcccct 
ctcgggtcag 
actacacagt 
ccagtccttt 
caaagtaggt 
tttcatctag 
ttgtgttttt 
gcgagggggg 
gaatatgaga 
atgcagcaga 
cacagaatgc 
agaccatt 



gcgatcaaga 
ccacaacttt 
caatgcattc 
agaaattagt 
tgctgatggg 
ttatcgaagc 
aaaagagcca 
agaaatcgct 
atgtaaaaac 
aaaggctcgg 
gtctattaca 
gcagctcaat 
agtattcaaa 
cgaagacaaa 
ttataaaacc 
gatggactct 
gtacagactg 
catctttgtt 
ctctttgaac 
atatgttacg 
tcatgtcctt 
tttctcattg 
tgaatggttg 
acattttaac 
cagctgtgag 
agtactagtt 



caacaggatc 
atcaaagtat 
aatcccatgt 
ggcctggcaa 
aagctgtatt 
atgggtgatg 
cactttcttc 
gtcgaacata 
gacatgggtg 
ctgaactgtt 
gacataatac 
agcatccctg 

ggacggttta 

gtgccaaagc 
tccatcgatt 
gccgttccac 
acggccatct 
ggctctgaag 
gacagcgtat 
agaacgccct 
ttggtcctct 
aaataaatct 
atgagtttaa 
agcacctctc 
cttgcataca 
aagtatttcc 



gagaaaactg 
ttgttccaag 
gtagatacta 
gatgcccatt 
ctgccacagt 
gatctgccct 
atgccataga 
ataatttagg 
gttcccagcg 
ctgtccctgg 
aaatcaatgg 
gttctgctgt 
aggaacagaa 
caaggcctgg 
tcccggatga 
ccattgccga 

cagtggacca 
ctggcatggt 
tactggaaga 
tcagcactgc 
aaattagcag 

tgggtttgtt 

aaataatgca 
ttatcttgca 
cacacacaca 
ttcctatctt 
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LOCUS (LOC) : 

GenBank ACC. NO. (GBN) : 

GenBank version (VER) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 
SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AX026746 GenBank (R) 
AX026746 

AX026746 .1 GI : 10187890 

386088-75-1 

3862 

dna; linear 

Patent 

16 Sep 2000 

Sequence 6 from Patent WO0031252. 
human. 

Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 971 a 1111 c 967 g 813 t 



REFERENCE: 

AUTHOR (AU) 
TITLE (TI): 



JOURNAL (SO) 



FEATURES (FEAT) 
Feature Key 



1 (bases 1 to 3862) 

Klostermann.A. ; Behl ,C. 

Human ***semaphorin*** ***s a *** _ 

(sema6a-a), a gene involved in neuronal development and 

regeneration mechanisms during apoptosis, and its use 

as a potential drug target 

Patent: WO 0031252-A 6 02-JUN-2000; KLOSTERMANN ANDREAS 
(DE) ; MAX PLANCK GESELLSCHAFT (DE) ; BEHL CHRISTIAN 
(DE) 



Location 



Qualifier 



source 

CDS 



1..3862 
658. .3750 



/organism="Homo sapiens" 

/db-xref= n taxon:9606" 

/note="unnamed protein product" 

/codon-start=l 

/protei n-i d="CAC09109 . 1" 

/db-xref="Gl : 10187891" 

/transl ation="MRSEALLLYFTLLHFAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYT 

VDIDTSHTEEIYCSKKLTWKSRQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 

TNAFNPSCRNYKMDTLEPFGDEFS 

GMARC P YDAKH AN VA L FADGK L YS ATVTD F LAI D 

AVIYRSLGESPTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVLEKQ\ATTSF 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

V L AT FST P YN S I PGS AVCAYDM L D 

IASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLERYATSNEFPDDTLNFI 

KTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVD 

TAAGPYQNHTWFLGSEKGIILKF 

LARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 

KRIMGMQLDRASSS LYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGAC 

SH LS PNS RLTFEQDIERGNTDGLG 

DCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 

GGMLDWKH L LDS PDSTDPLGAVS S 

HNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILA 

FVMGAVFSGITVYCVCDHRRKDVA 

WQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKD 

PKPEAI LTPLMHNGKLATPGNTAK 

MLIKADQHHLDLTALPTPESTPTLQQKRKPSRGS 

REWERNQN LINACTKDMPPMGS PV 

I PTD L P L RAS PS H I PS WV L P ITQQG YQH E YVDQ 

PKMSEVAQMALEDQAATLEYKTIK 

EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGP 

PGASLSQTGLSKRLEMHHSSSYGV 

DYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 

RNQSFGRGDNPPPAPQRVDSIQVH 

SSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSL 

KPDVPPKPSFAPLSTSMKPNDACT " 



SEQUENCE (SEQ): 

1 ggcacgaggc 
61 cagcagcggc 
121 agcggccagc 
181 cttgcccccc 
241 gcagacacca 
301 gagtttcttg 
361 tttttggctt 
421 tttgcatccc 
481 ggaatgacaa 
541 taatggatta 
601 ccgtcgatgc 
661 aggtcagaag 
721 gaagattctg 
781 gtgggccaca 
841 atgatcatga 
901 gacacatcac 
961 gccgatgtag 
1021 aaagttcttc 
1081 ccttcctgca 



tgcagccaac 
gctcctccgc 
atcaccacac 
tcccccagcc 
ccaaggggac 
aacatttacc 
cttttttcct 
agaaagtagt 
aggcttgcga 
ctaaatggga 
accgaaaagg 
ccttgctgct 
agccaatcag 
agccaggacg 
acggaaccct 
acacggaaga 
acacatgcag 
taaagaaaaa 
gaaactataa 



tccgctcccc 
ggtgccggtc 
ccgcggcacc 
cccaccccgc 
tcaccgaagt 
ctcttccttg 
ctccccttct 
cgccgcgact 
aggagagagc 
tacacgctgt 
gtgaagtaga 
atatttcaca 
tatttcgcat 
gaacaccaca 
ctacattgct 
aatttattgt 
aatgaaggga 
cgatgatgca 
gatggataca 



gcgcactcgg 
gcccgcgatg 
gcgctgccgg 
cccccgccct 
ggaatccaag 
ttggttttct 
ccgctcgtca 
atttccccca 
cgcagccgcg 
accagttcgc 
gaaataaagt 
ctgctacact 
ggcaactata 
cagaggcaca 
gctagggacc 
agcaaaaaac 
aaacataagg 
ttgtttgtct 
ttggaaccat 



ctgcccaggc 
cccgcttagc 
ccgcagagcc 
gaaatgactt 
tggaatttgg 
ttttcttttt 
ttggagatga 
aagagacaag 
gcccggagag 
tccgagcccc 
ctccccgctg 
ttgctggggc 
caaaacagta 
ggctggacat 
atatttatac 
tgacatggaa 
atgagtgcca 
gtggaactaa 
tcggggatga 



gctcggaacc 
agcgtgtagc 
gggccagagc 
gttaatcggc 
atttggagaa 
cttctttttt 
acacatcgcg 
cacacatgta 
atcccctcga 
ggccgcctgt 
aactactatg 
tggtttccca 
tccggtgttt 
ccagatgatt 
tgttgatata 
atctagacag 
caactttatt 
tgccttcaac 
attcagcgga 



1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 



atggccagat 
ctatactcag 
ggagaaagcc 
tttgttcaag 
gagtataaca 
atgggaggat 
aactgctcag 
gtgattcgta 
atccctgggt 
agattcaagg 
cctaagccca 
aatgagttcc 
gtgccctcca 
aaaattgcag 
tcagagaagg 
gacagccttt 
gtcgaagaca 
gcgttctcta 
aaaaaaacct 
tgcagccatt 
acagatggtc 
ctcttgccca 
ggaatgctgg 
gtgtcttccc 
cacgaccagc 
ggggccgtct 
gctgtggtgc 
gtcaccaagc 
atcctcacgc 
ctcattaaag 
ccaacgctgc 
aacctcatca 
gacctgcccc 
cagcagggct 
gcgctggagg 
aagagtccca 
ccacagcggg 
aagcggctgg 
acgaactcgc 
tccaattcct 
gccccgcaga 
actgtctcga 
cgtacgccct 
tccatgaagc 
agcaggcaag 
ccagaccaag 



gcccatatga 
ccacagtgac 
ctaccctgcg 
ccgtggatta 
ccatgggaaa 
ctcaaagagt 
ttcctggaga 
tcaacgggcg 
ctgcagtctg 
aacagaagtc 
ggccaggttg 
ctgatgatac 
tcttcaacag 
tggacacagc 
gaatcatctt 
tcctggagga 
aaaggatcat 
cctgtgtgat 
gtattgcctc 
tatcacccaa 

tgggggactg 
gcacaaccac 
actggaagca 
ataatcacca 
tggttcccgt 
tctcgggcat 
agcgcaagga 
tcagcggcct 
cactcatgca 
cagaccagca 
agcagaagcg 
atgcctgcac 
tgcgggcctc 
accagcatga 
accaggccgc 
accatggggt 
aggcctccct 
aaatgcacca 
tcacgagaag 
ctcacctctc 

gggtggactc 

ggcagcccag 
cgctaaagcc 
ccaatgatgc 
gcgaggtgcc 
aaggcctgcg 



tgccaaacat 
tgacttcctt 
gaccgtcaag 
cggagattat 
ggtagttttc 
cctggagaaa 
ctctcatttt 
tgatgttgtc 
tgcctatgac 
tcctgattcc 
ctgtgctggc 
cctgaacttc 
gccatggttc 
tgctgggcca 
gaagtttttg 
gatgagtgtt 
gggcatgcag 
aaaggttccc 
cagagaccca 
cagcagactg 
tcacaattcc 
atcagattcg 
tctgcttgac 
agacaagaag 
caccctcttg 
caccgtctac 
gaaggagctc 
ctttggggac 
caacggcaag 
ccacctggac 
gaagcccagc 
aaaggacatg 
ccccagccac 
gtacgtggac 
cacactggag 
gaaccttgtg 
gggtcccccg 
ctcctcttcc 
ccaccaggcc 
cagaaaccag 
catccaggtg 
cctcaacgcc 
ggacgtaccc 
gtgtacataa 
cgctcagctc 



gccaacgttg 
gccattgacg 
cacgattcaa 
atctacttct 
ccaagagtgg 
cagtggacgt 
tatttcaaca 
ctggcaacgt 
atgcttgaca 
acctggacac 
tcatcctcct 
atcaagacgc 
ctgagaacaa 
tatcagaatc 
gccagaatag 
tacaactctg 
ctggacagag 
cttggccggt 
tattgtggat 
acttttgagc 
tttgtggcac 
acggctcaag 
tcacctgaca 
ggagtgattc 
gccattgcag 
tgcgtctgtg 
acccactcgc 
actcaatcca 
ctcgccactc 
ctgacggccc 
cgcggcagcc 
ccccccatgg 
atccccagcg 
cagcccaaaa 
tataagacca 
gagaacctgg 
ggagcctccc 
tacggggttg 
accactctca 
agctttggca 
cacagctccc 
tacaactcac 
cccaaaccat 
tcccaggggg 
agcaaggttc 



cactgtttgc 
cagtcattta 
aatggttgaa 
tcttcaggga 
ctcaggtttg 
cgttcctgaa 
ttctccaggc 
tttctacacc 
ttgccagtgt 
cagttcctga 
tagaaagata 
acccgctcat 
tggtcagata 
acactgtggt 
gaaatagtgg 
aaaaatgcag 
caagcagctc 
gtgaacgaca 
ggataaagga 
aggacataga 
tgaatgggca 
aggggtatga 
gcacagaccc 
gggaaagtta 
tcatcctggc 
atcatcggcg 
gccggggctc 
aagacccaaa 
ccggcaacac 
tccccacccc 
gcgagtggga 
gctcccctgt 
tggtggtcct 
tgagcgaggt 
tcaaggaaca 
acagcctgcc 
tgtctcagac 
actataagag 
aaagaaacaa 
ggggagacaa 
agccatctgg 
tgacaaggtc 
cctttgctcc 

agggggtcag 
tcaactgcct 



agatggaaaa 
ccggagtctt 
agaaccatac 
aatagcagtg 
taagaatgat 
ggcgcgcttg 
agttacagat 
ttataacagc 
ttttactggg 
tgaacgagtt 
tgcaacctcc 
ggatgaggca 
ccgccttacc 
ttttctggga 
ttttctaaat 
ctatgatgga 
tctgtatgtt 
tgggaagtgt 
aggtggtgcc 

gcgtggcaat 

ttccagttcc 
gtctagggga 
tttgggggca 
cctcaaaggc 
tttcgtcatg 
caaagacgtg 
catgagcagc 
gccggaggcc 
ggccaagatg 
agagtcaacc 
gaggaaccag 
gattcccacg 
gcccatcacg 
ggcccagatg 
tctcagcagc 
ccccaaagtt 
cggtctaagc 
gagctacccc 
cactaactcc 
cccgccgccc 
ccaggccgtg 
ggggctgaag 
cctttccaca 
gtgtcgaacc 
cgagtaccca 
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LOCUS (LOC): 
GenBank acc. no. (GBN) 
GenBank version (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 
SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 
Gi: 10187888 



JOURNAL (SO) : 



AX026743 
AX026743 
AX026743.1 
390282-37-8 
216 

DNA; linear 

Patent 

16 Sep 2000 

Sequence 3 from Patent WO0031252. 
human. 

Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 
: 45 a 85 c 51 g 35 t 
1 (bases 1 to 216) 
Klostermann,A. ; Behl,c. 

Human ***semaphorin*** ***5 a *** _ ***]*** 
(sema6a-a), a gene involved in neuronal development and 
regeneration mechanisms during apoptosis, and its use 
as a potential drug target 

Patent: WO 0031252-A 3 02-JUN-2000; KLOSTERMANN ANDREAS 
(DE) ; MAX PLANCK GESELLSCHAFT (DE) ; BEHL CHRISTIAN 
(DE) 



FEATURES (FEAT) : 

Feature Key Location Qualifier 



source 
CDS 



1..216 /organism= n Homo sapiens" 

/db-xref= M taxon : 9606" 
<1. ->216 /note="unnamed protein product" 

/codon-start=l 
/protei n-i d="CAC09108 . 1" 
/db-xref ="GI : 10187889" 
/translation="pppAPQRVDSlQVHSSQPSG 

QAVTVSRQPS LNAYNS LTRSGLKR 
TPSLKPDVPPKPSFAPLSTSMKPNDACT" 

SEQUENCE (SEQ): 

1 ccgccgcccg ccccgcagag ggtggactcc atccaggtgc acagctccca gccatctggc 
61 caggccgtga ctgtctcgag gcagcccagc ctcaacgcct acaactcact gacaaggtcg 
121 gggctgaagc gtacgccctc gctaaagccg gacgtacccc ccaaaccatc ctttgctccc 
181 ctttccacat ccatgaagcc caatgatgcg tgtaca 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 
SOURCE : 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 
Gl: 10187886 



JOURNAL (SO): 



FEATURES (FEAT): 
Feature Key 

source 1. 

CDS l. 



AX026741 
AX026741 
AX026741.1 
390282-36-7 
3093 

DNA; linear 

Patent 

16 Sep 2000 

Sequence 1 from Patent WO0031252. 
human. 

Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 813 a 855 c 765 g 660 t 
1 (bases 1 to 3093) 
Klostermann,A. ; Behl ,c. 

Human ***semaphorin*** ***5 a *** _ ***]*«« 
(sema6a-a) , a gene involved in neuronal development and 
regeneration mechanisms during apoptosis, and its use 
as a potential drug target 

Patent: WO 0031252-A 1 02-JUN-2000; KLOSTERMANN ANDREAS 
(DE) ; MAX PLANCK GESELLSCHAFT (DE) ; BEHL CHRISTIAN 
(DE) 



Location Qualifier 

■3093 /organism="Homo sapiens" 

/db-xref="taxon:9606" 

■3093 /note="unnamed protein product" 

/codon-start=l 
/protei n-i d="CAC09107 . 1" 
/db-xref="GI : 10187887" 
/transl ation="MRSEALLLYFTLLHFAGAGF 
PEDSEPISISHGNYTKQYPVFVGH 
KPGRNTTQRHRLDIQMIMIMNGTLYIAARDHIYT 
VDIDTSHTEEIYCSKKLTWKSRQA 
DVDTCRMKGKHKDECHNFIKVLLKKNDDALFVCG 
TNAFNPSCRNYKMDTLEPFGDEFS 
GMARCPYDAKHANVALFADGKLYSATVTDFLAID 
AVI YRS LGES PTLRTVKHDSKWLK 
EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 
RVAQVCKNDMGGSQRVLEKQWTS F 
LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 
VL AT FST PYN S I PGS AVCA YDM LD 
IASVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 
CAGSSS LERYATSNEFPDDTLN FI 
KTHPLMDEAVPSIFNRPWFLRTMVRYRLTKIAVD 
TAAGPYQNHTWFLGSEKGIILKF 
LARIGNSGFLNDSLFLEEMSVYNSEKCSYDGVED 
KRIMGMQLDRASSSLYVAFSTCVI 
KVPLGRCERHGKCKKTCIASRDPYCGWIKEGGAC 
SHLSPNSRLTFEQDIERGNTDGLG 
DCHNSFVALNGHSSSLLPSTTTSDSTAQEGYESR 
GGMLDWKHLLDSPDSTDPLGAVSS 
HNHQDKKGVIRESYLKGHDQLVPVTLLAIAVILA 



FVMGAVFSGITVYCVCDHRRKDVA 

WQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKD 

PKPEAILTPLMHNGKLATPGNTAK 

MLIKADQHHLDLTALPTPESTPTLQQKRKPSRGS 

REWERNQN LIN ACTKDM P PMGS PV 

I PTD L P L RAS PS H I PS VWL PITQQGYQH EYVDQ 

PKMSEVAQMALEDQAATLEYKTIK 

EHLSSKSPNHGVNLVENLDSLPPKVPQREASLGP 

PGASLSQTGLSKRLEMHHSSSYGV 

DYKRSYPTNSLTRSHQATTLKRNNTNSSNSSHLS 

RNQS FGRGDN P P PAPQRVDS IQVH 

SSQPSGQAVTVSRQPSLNAYNSLTRSGLKRTPSL 

KPDVPPKPSFAPLSTSMKPNDACT " 

SEQUENCE (SEQ): 

1 atgaggtcag aagccttgct gctatatttc acactgctac actttgctgg ggctggtttc 
61 ccagaagatt ctgagccaat cagtatttcg catggcaact atacaaaaca gtatccggtg 
121 tttgtgggcc acaagccagg acggaacacc acacagaggc acaggctgga catccagatg 
181 attatgatca tgaacggaac cctctacatt gctgctaggg accatattta tactgttgat 
241 atagacacat cacacacgga agaaatttat tgtagcaaaa aactgacatg gaaatctaga 
301 caggccgatg tagacacatg cagaatgaag ggaaaacata aggatgagtg ccacaacttt 
361 attaaagttc ttctaaagaa aaacgatgat gcattgtttg tctgtggaac taatgccttc 
421 aacccttcct gcagaaacta taagatggat acattggaac cattcgggga tgaattcagc 
481 ggaatggcca gatgcccata tgatgccaaa catgccaacg ttgcactgtt tgcagatgga 
541 aaactatact cagccacagt gactgacttc cttgccattg acgcagtcat ttaccggagt 
601 cttggagaaa gccctaccct gcggaccgtc aagcacgatt caaaatggtt gaaagaacca 
661 tactttgttc aagccgtgga ttacggagat tatatctact tcttcttcag ggaaatagca 
721 gtggagtata acaccatggg aaaggtagtt ttcccaagag tggctcaggt ttgtaagaat 
781 gatatgggag gatctcaaag agtcctggag aaacagtgga cgtcgttcct gaaggcgcgc 
841 ttgaactgct cagttcctgg agactctcat ttttatttca acattctcca ggcagttaca 
901 gatgtgattc gtatcaacgg gcgtgatgtt gtcctggcaa cgttttctac accttataac 
961 agcatccctg ggtctgcagt ctgtgcctat gacatgcttg acattgccag tgtttttact 
1021 gggagattca aggaacagaa gtctcctgat tccacctgga caccagttcc tgatgaacga 
1081 gttcctaagc ccaggccagg ttgctgtgct ggctcatcct ccttagaaag atatgcaacc 
1141 tccaatgagt tccctgatga taccctgaac ttcatcaaga cgcacccgct catggatgag 
1201 gcagtgccct ccatcttcaa caggccatgg ttcctgagaa caatggtcag ataccgcctt 
1261 accaaaattg cagtggacac agctgctggg ccatatcaga atcacactgt ggtttttctg 
1321 ggatcagaga agggaatcat cttgaagttt ttggccagaa taggaaatag tggttttcta 
1381 aatgacagcc ttttcctgga ggagatgagt gtttacaact ctgaaaaatg cagctatgat 
1441 ggagtcgaag acaaaaggat catgggcatg cagctggaca gagcaagcag ctctctgtat 
1501 gttgcgttct ctacctgtgt gataaaggtt ccccttggcc ggtgtgaacg acatgggaag 
1561 tgtaaaaaaa cctgtattgc ctccagagac ccatattgtg gatggataaa ggaaggtggt 
1621 gcctgcagcc atttatcacc caacagcaga ctgacttttg agcaggacat agagcgtggc 
1681 aatacagatg gtctggggga ctgtcacaat tcctttgtgg cactgaatgg gcattccagt 
1741 tccctcttgc ccagcacaac cacatcagat tcgacggctc aagaggggta tgagtctagg 
1801 ggaggaatgc tggactggaa gcatctgctt gactcacctg acagcacaga ccctttgggg 
1861 gcagtgtctt cccataatca ccaagacaag aagggagtga ttcgggaaag ttacctcaaa 
1921 ggccacgacc agctggttcc cgtcaccctc ttggccattg cagtcatcct ggctttcgtc 
1981 atgggggccg tcttctcggg catcaccgtc tactgcgtct gtgatcatcg gcgcaaagac 
2041 gtggctgtgg tgcagcgcaa ggagaaggag ctcacccact cgcgccgggg ctccatgagc 
2101 agcgtcacca agctcagcgg cctctttggg gacactcaat ccaaagaccc aaagccggag 
2161 gccatcctca cgccactcat gcacaacggc aagctcgcca ctcccggcaa cacggccaag 
2221 atgctcatta aagcagacca gcaccacctg gacctgacgg ccctccccac cccagagtca 
2281 accccaacgc tgcagcagaa gcggaagccc agccgcggca gccgcgagtg ggagaggaac 
2341 cagaacctca tcaatgcctg cacaaaggac atgcccccca tgggctcccc tgtgattccc 
2401 acggacctgc ccctgcgggc ctcccccagc cacatcccca gcgtggtggt cctgcccatc 
2461 acgcagcagg gctaccagca tgagtacgtg gaccagccca aaatgagcga ggtggcccag 
2521 atggcgctgg aggaccaggc cgccacactg gagtataaga ccatcaagga acatctcagc 
2581 agcaagagtc ccaaccatgg ggtgaacctt gtggagaacc tggacagcct gccccccaaa 
2641 gttccacagc gggaggcctc cctgggtccc ccgggagcct ccctgtctca gaccggtcta 
2701 agcaagcggc tggaaatgca ccactcctct tcctacgggg ttgactataa gaggagctac 
2761 cccacgaact cgctcacgag aagccaccag gccaccactc tcaaaagaaa caacactaac 
2821 tcctccaatt cctctcacct ctccagaaac cagagctttg gcaggggaga caacccgccg 
2881 cccgccccgc agagggtgga ctccatccag gtgcacagct cccagccatc tggccaggcc 
2941 gtgactgtct cgaggcagcc cagcctcaac gcctacaact cactgacaag gtcggggctg 
3001 aagcgtacgc cctcgctaaa gccggacgta ccccccaaac catcctttgc tcccctttcc 
3061 acatccatga agcccaatga tgcgtgtaca taa 
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LOCUS (LOC): AW343314 GenBank (R) 

GenBank ACC. NO. (GBN) : AW343314 

GenBank VERSION (VER) : AW343314.1 GI: 6839680 

CAS REGISTRY NO. (RN) : 254706-97-3 

SEQUENCE LENGTH (SQL): 664 

MOLECULE TYPE (ci) : mRNA; linear 



DIVISION CODE (CI) 
DATE (DATE) : 
DEFINITION (DEF): 



SOURCE : 
ORGANISM (ORGN) : 



mRNA sequence. 



Expressed sequence tag 
31 Jan 2000 

fi76al0.yl Sugano Kawakami zebrafish DRA Danio rerio 
cdna clone 2643546 5' similar to TR:o35464 035464 

***SEMAPHORIN*** *** VIA *** 
zebrafish. 
Danio rerio 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopterygii; Teleostei; 
Ostariophysi ; Cypriniformes ; Cyprinidae; Danio 
NUCLEIC ACID COUNT (NA) : 168 a 148 c 177 g 171 t 
COMMENT: 

Contact: s.L. Johnson 

Washington University school of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email : est@watson.wustl .edu 

Library constructed by Dr. Sumio Sugano and Dr. Koichi Kawakami DNA 
Sequencing by: Washington University Genome sequencing Center 
Seq primer: T3 ET from Amersham 
High quality sequence stop: 478 



REFERENCE: 

AUTHOR (AU): 



TITLE (TI) : 
JOURNAL (SO): 

FEATURES (FEAT): 
Feature Key 

source 1. .664 



Marra,M. ; 
Gibbons ,M. ; 



1 (bases 1 to 664) 

Sugano, s.; Kawakami, K.; Johnson ,S.; Li , F 
Eddy,S.; Hillier,L.; Clifton, S.; Allen, M 
Jost,s.; Kucaba,T.; Martin, J.; Pape,D.; steptoe,M.; 
Underwood, K. ; Theising.B.; Ritter,E. ; Bowers,Y.; 
Wylie,T.; waterston , R. ; Wilson, R. 
washu Zebrafish EST Project 1999 
Unpublished (1999) 



Locati on Qua! i f i e r 

/organism="Danio rerio" 
/strain="AB" 
/db-xref="taxon:7955 n 
/c1one="2643546" 
/clone-lib="sugano Kawakami 
zebrafish DRA" 

/sex="mixed (one male and one 
female, including unfertilized 
eggs)" 

/dev-stage="adult" 
/lab-host="DH10B (phage 
resistant) 1 ' 

/note="vector: pMEl8s-FL3; site-1: 
Drain (CACTGTGTG); site-2: Drain 
(CACCATGTG) ; 1st strand cDNA was 
primed with an oligo(dT) primer 
[ATGTGGCCTTTTTTTTTTTTTTTTT] ; 
double-stranded cDNA was li gated 
to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and 
cloned into distinct Drain sites 
of the pMEl8S-FL3 vector (5' site 
CACTGTGTG, 3' site CACCATGTG). 
Xhol should be used to isolate the 
cDNA insert, size selection was 
performed to exclude fragments 
<1.5kb. Library constructed by 
Dr. sumio Sugano (University of 
Tokyo institute of Medical 
Science) and kindly donated by Dr. 
Koichi Kawakami. custom primers 
for sequencing: 5 1 end primer 
CTTCTGCTCTAAAAGCTGCG and 3' end 

primer cgacctgcagctcgagcaca. " 

SEQUENCE (SEQ): 

1 tctaaagaaa acacgggaaa tatggaataa gatggtgttc gaagccaaac ggttgacgga 

61 aagaacatgg tgttgctctg tcatcatggc gatggtctta ttagcctggc tcctcccact 

121 cattacttct gccacgcctt ttcctagaga tctgcagcca attagtgtgg tgggattgga 

181 cgactcgtac ctgtacccca gttttcaggg tctggtgtcc agcaatgaga cggagcgtct 

241 gggtctggac tatcagcgca tgatgaggat ccagcacatg ctgtacatcg ccgccagaga 

301 ccatgtgttt gttgtaaatc tcacaacggc agtagatgaa attattccac agcagatcct 



361 gacgtggaga 
421 ttacaattat 
481 aaacgcgttg 
541 ggagctcttg 
601 tgcaggtggg 
661 ctac 



tccacagacg 
atcaaggttc 
aatcctgcct 
ggtcaggcaa 
tatttctatt 



tgtccaagtg 
ttgttcctcg 
gccgcaacta 
gatgtccatt 
cagccacagt 



caccgtcaga 
taatgacggg 
cagattgagt 
tgagtctcga 
gacggactta 



ggaagaaaca 
actctgtttg 
tcactggagc 
cagtccaatg 
caggctagtg 



gtgatgaatg 
cctgtggaac 
aggtcggaca 
gtagagtgtt 
atgctgagat 
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locus (loc) : 
GenBank ACC. NO 
GenBank version 

CAS REGISTRY NO 
SEQUENCE LENGTH 
MOLECULE TYPE 
DIVISION CODE 
DATE (DATE): 
DEFINITION (DEF): 



(GBN) : 
(VER) 
(RN) 
(SQL) 
(CI): 
(CI): 



SOURCE : 
ORGANISM 



(ORGN) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AW280824 GenBank (R) 
AW280824 

AW2 80824.1 GI: 6669373 

252531-29-6 

656 

mRNA; linear 
Expressed sequence tag 
26 Jul 2002 

f;j'46cl2.yl zebrafish adult brain Danio rerio cDNA 
similar to TR:035464 035464 ***SEMAPHORIN*** 

*** VIA *** m - ( mRNA sequence. 
Danio rerio. 
Danio rerio 

Eukaryota; Metazoa; chordata 
Euteleostomi ; Actinopterygii 
Ostariophysi ; cyprini formes 
170 a 162 c 188 g 136 



5' 



Craniata; Vertebrata; 
Neopterygii; Teleostei 
Cyprini dae; Danio 



MO 63108, USA 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Other_ESTs: fi46cl2.xl 

Contact: Stephen L. Johnson 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis 

Tel : 314 286 1800 

Fax: 314 286 1810 

Email : zbrafishOwatson.wustl .edu 

cDNA Library Preparation: John Ngai . cDNA Library Arrayed by: 
Matthew Clark. DNA Sequencing by: Washington University Genome 
Sequencing Center Clone distribution: Genome Systems, St. Louis, 
Missouri (web address: www.genomesystems.com) (email contact: 
info@genomesystems.com) and Research Genetics, Huntsville, Alabama 
(web address: www.resgen.com) (email contact: info@resgen.com) and 
RessourcenZentrumPrimarDatenbank, Berlin, Germany (web address: 
www. rzpd.de) 
Seq primer: T7 

High quality sequence stop: 481. 



REFERENCE: 
AUTHOR 



(AU) 



TITLE (TI): 
JOURNAL (SO) : 

FEATURES (FEAT): 
Feature Key 

source 1. .656 



1 (bases 1 to 656) 
Clark, M.; Johnson, s. L. ; Lehrach,H.; Lee,R.; Li , F . ; 
Marra,M.; Eddy,s.y; Hillier,L.; Kucaba.T.; Martin, J.; 
Beck,C; Wylie.T.; Underwood, K. ; Steptoe,M.; 
Theising,B.; Allen, M.; Bowers, Y. ; Person, B.; 
Swaller,T.; Gibbons, M.; Pape,D.; Harvey, N.; Schurk,R. 
Ritter,E.; Kohn,s.; Shin,T.; Jackson, Y.; Cardenas, M.; 
McCann,R.; Wilson,R. 
WashU zebrafish EST Project 1998 
Unpublished (1998) 



Location 



Qualifier 

/organism="Danio rerio" 
/db-xref="taxon:7955" 
/clone-lib="zebrafish adult brain' 
/sex="mixed male and female" 
/tissue-type="brain" 
/dev-stage="adul t" 
/lab-host="E. coli DH10B" 
/note="vector: pziPLOX; Site-1: 
Notl; site-2: Sail; original 
library was constructed in 
lambdaZIPLOX. Mass excision of 
the cDNA library was performed to 
yield pziPLOX plasmids. Insert 
check was done in original 
library." 



SEQUENCE (SEQ) : 

1 ctgctcgtgt gtgtggagct gaatgagcgc gcattgttcc tttggttgtt tctcctctac 



61 ccgcgcggat 
121 atgtgaggaa 
181 agatgcgagc 
241 tcccagaaga 
301 tgttcgtggg 
361 tgatcgtcat 
421 acatggagac 
481 gacaggcgga 
541 tcatcaaggt 
601 tcaacccgtt 



cgatcgatca gtgatcggat 
ggtgtcaaac acagcggctg 
gcaggccctg ctgctgtacc 
caccgagccc atcagcatcg 
acacaaacca ggcaggaata 
catgaaccgg acgctgtatg 
gtccagcaca gaggagatct 
tgtggacacg tgcaggatga 
gctcctgcag cagagtgagg 
ctgcaaaacc tacaggatgg 



gtgatcggga tttttctgat 
aagttcagaa gagatcgtgc 
tgacgctgct gcagacggct 
cacatggaaa ccacactaaa 
acacacagcg acacaaactg 
tggctgcgag ggatcacatt 
tcttcagtaa gaagctgacc 
agggcaaaca caaggacgag 
actctctgtt tttgtgcggg 
acagtctgga ccctctgggt 



ggatatatcg 
gggacagtaa 
ggagccgctt 
cagtacccag 
gacatccagc 
tacacgctgg 
tggaagtcca 
tgtcataact 
accaacgcct 
gaagaa 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



source: 

ORGANISM (ORGN) : 



AW128036 
AW128036 

AW128036.1 GI: 6115940 
245383-36-2 
802 

mRNA; linear 
Expressed sequence tag 
25 Oct 1999 

fi06al2.yl Sugano Kawakami zebrafish DRA Danio rerio 
cDNA clone 2600350 5' similar to TR:035464 035464 

***SEMAPHORIN*** *** V IA*** . ;, mRNA sequence, 
zebrafish. 
Danio rerio 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopterygii; Teleostei; 
Ostariophysi ; cypriniformes ; Cyprinidae; Danio 
NUCLEIC ACID COUNT (NA) : 202 a 182 c 212 g 202 t 4 others 
COMMENT: 

Other_ESTs: fi06al2.xl 
Contact: S.L. Johnson 

Washington university School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 
Tel : 314 286 1800 
Fax: 314 286 1810 
Email : est@watson.wustl .edu 

Library constructed by Dr. Sumio Sugano and Dr. 
Sequencing by: Washington University Genome sequencing Center 
Seq primer: T3 ET from Amersham 
High quality sequence stop: 488. 



Koichi Kawakami DNA 



REFERENCE: 
AUTHOR 



(AU): 



TITLE (TI): 
JOURNAL (SO) : 

FEATURES (FEAT) : 
Feature Key 



1 (bases 1 to 802) 
sugano, s.; Kawakami, K.; Johnson, S.; Li , F. ; Marra,M.; 
Eddy,s.; Hillier,L.; Clifton,S.; Allen,M.; Gibbons, M. 
Jost,s.; Kucaba,T.; Martin, J.; Pape,D.; Steptoe,M.; 
underwood, K. ; Theising.B.; Ritter.E. ; Bowers,Y.; 
wylie,T.; Waterston.R. ; Wilson,R. 
WashU Zebrafish EST Project 1999 
Unpublished (1999) 



Location 



Qualifier 



source 1..802 /organism="Danio rerio" 

/strain="AB" 
/db-xref="taxon:7955" 
/clone=' , 2600350" 
/clone-lib="Sugano Kawakami 
zebrafish DRA" 

/sex="mixed (one male and one 
female, including unfertilized 
eggs) 1 ' 

/dev-stage-"adul t " 
/lab-host= M DHl0B (phage 
resistant)" 

/note="Vector: pMEl8s-FL3; Site-1: 
Drain (CACTGTGTG); Site-2: Drain 
(CACCATGTG) ; 1st strand cDNA was 
primed with an oligo(dT) primer 

double-stranded cDNA was li gated 
to a Drain adaptor 
[TGTTGGCCTACTGG] , digested and 
cloned into distinct Dram sites 
of the pMEl8s-FL3 vector (5' site 



CACTGTGTG, 3' site CACCATGTG) . 
xhol should be used to isolate the 
cDNA insert. Size selection was 
performed to exclude fragments 
<1.5kb. Library constructed by 
Dr. sumio Sugano (University of 
Tokyo institute of Medical 
Science) and kindly donated by Dr. 
Koichi Kawakami . custom primers 
for sequencing: 5' end primer 
CTTCTGCTCTAAAAGCTGCG and 3 1 end 
primer CGACCTGCAGCTCGAGCACA . " 

SEQUENCE (SEQ) : 

1 tttaaagaaa acacgggaaa tatggaataa gatggtgttc gaagccaaac ggttgacgga 
61 aagaacatgg tgttgctctg tcatcatggc gatggtctta ttagcctggc tcctcccact 
121 cattacttct gccacgcctt ttcctagaga tctgcagcca attagtgtgg tgggattgga 
181 cgactcgtac ctgtacccca gttttcaggg tctggtgtcc agcaatgaga cggagcgtct 
241 gggtctggac tatcagcgca tgatgaggat ccagcacatg ctgtacatcg ccgccagaga 
301 ccatgtgttt gttgtaaatc tcacaacggc agtagatgaa attattccac agcagatcct 
361 gacgtggaga tccacagacg tgtccaagtg caccgtcaga ggaagaaaca gtgatgaatg 
421 ttacaattat atcaaggttc ttgttcctcg taatgacgag actctgtttg cctgtggaac 
481 aaacgcgttg aatcctgcct gccgcaacta cagattgagt tcactggagc aggtcggaca 
541 ggagctcttg ggtcaggcaa gatgtccatt tgagtctcga cagtgcaatg tangagtgtg 
601 tgcacgtggt catttctatt cagccacagt gacggacttc caggcgagtg atgctgtgat 
661 ctacaggagt ntangaggag agggccgacc tgtgctgcgc actgtcaaat accactgcaa 
721 atggctcana gagcctcatt ccctgcacgc tgtcgaatac cgggaactat ggtatttctt 
781 ctccaatgag aatgctgatg ag 
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locus (loc): AI800602 GenBank (R) 

GenBank ACC NO. (GBN) : AI800602 



GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 



AI800602.1 Gl: 5366162 

238193-66-3 

820 



MOLECULE TYPE (ci) : mRNA; linear 
DIVISION code (ci) : Expressed sequence tag 
DATE (DATE): 19 Dec 1999 

DEFINITION (DEF): wgl2dl0.xl soares_NSF_F8_9w_OT_PA_P_Sl Homo sapiens 

cDNA clone IMAGE:2364883 3' similar to TR:o35464 035464 
***semaphorin*** *** VIA *** m . f mRNA sequence. 
SOURCE: human. 
ORGANISM (orgn): Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

NUCLEIC ACID COUNT (NA) : 208 a 170 c 157 g 279 t 6 others 
COMMENT : 

Contact: Robert strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE Consortium (info@image.llnl.gov) for further information. 

Insert Length: 1688 std Error: 0.00 

Seq primer: -40UP from Gibco 

High quality sequence stop: 447. 
REFERENCE: 1 (bases 1 to 820) 

AUTHOR (AU) : NCI-CGAP http://www.ncbi .nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP) , Tumor Gene Index 
JOURNAL (SO): Unpublished (1997) 



FEATURES (FEAT) : 

Feature Key Location 

source 1..820 



Qualifier 

/organism="Homo sapiens" 

/db-xref ="taxon : 9606" 

/clone="lMAGE:2364883" 

/clone-! ib= n soares-NSF-F8-9W-OT-PA 

-P-Sl" 

/lab-host= ,, DHlOB" 
/note="organ: pooled; vector: 
pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site-1: Not 
I; Site-2: Eco ri; Equal amounts 
of plasmid DNA from five 



normalized libraries were mixed, 
and ss circles were made in vitro. 
Following HAP purification, this 
DNA was used as tracer in a 
subtractive hybridization 
reaction. The driver was 
PCR-amplified cDNAs from pools of 
5,000 clones made from the same 5 
libraries. The pools consisted of 
the following libraries and 
clonelDs: soares NbHSF pool 1: 
309384-310919, 323208-325895 
Soares Nb2HP pool 1: 
145032-147335, 147720-148103, 
148872-149255, 15002 - 150407, 
151176-152327 Soares Nb2HF8-9W 
pool 1: 758280-760583, 
772104-774407 Soares NbHPA pool 1: 
304776-306311, 320136-322823, 
326280-326663 Soares NbHOT pool 1: 

723720-726407, 739080-740999 
subtraction by Bento Soares and M. 
Fat i ma Bona! do." 



SEQUENCE (SEQ): 
1 ttttaatatt 
61 tttctttttt 
121 aatcagatgt 
181 aactttacaa 
241 aagcaaaggc 
301 ctcttgggaa 
361 agtagatata 
421 aatcgtgctt 
481 caatggcaag 
541 cgttggcatg 
601 ttncaatgta 
661 caacaatgca 
721 tatgttttcc 
781 tttttttgct 



ttacagtcag 
cccttttttt 
taggggatgt 
caaatataaa 
acactcgggt 
aactaccttt 
atctccgtaa 
gacggtccgc 
gaagtcagtc 
tttggcatca 
tccatcttta 
tcatccgttt 
cttcattctg 
acataaaatt 



gttttattac 
cttttctttt 
ggagatggaa 
tctgagtttg 
ttatggaccc 
cccatggtgt 
tccacggctt 
agggtagggc 
actgtggctg 
tatgggcatc 
tagttctgca 
tctttagaag 
catgtggcta 
tctttccggg 



ttttaagtaa 
ttcttttctt 
ggaaaattgg 
ttgcatctac 
tacccatata 
tatactccac 
gaacaaagta 
tttctccaag 
agtatagttn 
tggccattcc 
ggaagggttg 
aacttttann 
ccatcggcct 
gtgatgtgtt 



taaagagcct 
acaacataca 
tgacatcaca 
cagtgtctag 
attcttacaa 
tgctatttcc 
tggttctttc 
actccggtaa 
tccatctngc 
actgaattca 
angggcatag 
taagtgtggc 
gtctaagatt 



tttccttgct 
ttaagtcgtg 
atatttttac 
caagggtgga 
acctgagcca 
ctgaagaaga 
aaccattttg 
atgactgcgt 
aacagtgcaa 
tccccgaatg 
ttccacagac 
aactcatccc 
tcccatgcag 
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LOCUS (LOC): 
GenBank Acc. no. (gbn) 
GenBank version (ver) : 
CAS registry no. (rn): 
sequence LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (ci) : 
DATE (DATE) : 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (r) 



source: 

organism (orgn) 



AI571889 
AI571889 

AI571889.1 GI:4535263 
229599-17-1 
414 

mRNA; linear 
Expressed sequence tag 
12 May 1999 

to20gl0.xl NCI_CGAP_Ut2 Homo sapiens cDNA clone 
IMAGE:2179650 3' similar to TR:035464 035464 

* * * S EM A PHO R I N * * * *** V IA*** . ; , mRNA sequence 

human. 

Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhi 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 94 a 86 c 89 g 145 t 
COMMENT: 

Contact: Robert strausberg, Ph.D. 
Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: Christopher Moskaluk, M.D., Ph.D., Michael R. 
Emmert-Buck, M.D., Ph.D. 

cdna Library Preparation: Life Technologies, inc. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing center 
clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

www-bi o . 1 1 nl . gov/bbrp/image/image . html 

Insert Length: 978 Std Error: 0.00 

Seq primer: -40UP from Gibco 

POLYA=No. 

REFERENCE: 1 (bases 1 to 414) 

author (au) : nci-cgap http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 



JOURNAL (SO) : 



Project (cgap) , Tumor Gene Index 
Unpublished (1997) 



FEATURES (FEAT) : 

Feature Key Location 

========- === ==-+=====—= ====: 

source 1..414 



Qualifier 



/organism="Homo sapiens" 

/db-xref="taxon:9606" 

/cl one= n IMAGE : 2179650" 

/cl one-1 i b="NCI-CGAP-Ut2 " 

/ti ssue-type="moderatel y-di f f erent 

iated endometrial adenocarcinoma, 

3 pooled tumors" 

/lab-host="DHlOB" 

/note="organ: uterus; Vector: 

PCMV-SPORT6; site-1: Sail; Site-2: 

NotI; Cloned unidi rectionally. 

Primer: oliqo dT. Average insert 

size 1.85 kb. Life Technologies 

catalog #: 11539-012" 



SEQUENCE (SEQ): 

1 tttgttactg attttagaaa cagtattaag caccttgttg ttttgtttct gttttcaagg 

61 aatgcagatt ctactaagat ctccttgcaa aatgaatgca tttaattttc cctatgtgtt 

121 tttcctctgc aggtagtttt cccaagagtg gctcaggttt gtaagaatga tatgggagga 

181 tctcaaagag tcctggagaa acagtggacg tcgttcctga aggcgcgctt gaactgctca 

241 gttcctggag actctcattt ttatttcaac attctccagg cagttacaga tgtgattcgt 

301 atcaacgggc gtgatgttgt cctggcaacg ttttctacac cttataacag gtaatcatgc 

361 cctagctgtg ttgacctcat caattcttcc tggcttcctt cttccagggg gttc 
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LOCUS (LOC): 

GenBank ACC. NO. (GBN) 

GenBank VERSION (ver) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 



SOURCE: 
ORGANISM (ORGN): 



AI384100 GenBank (R) 
AI384100 

AI384100.1 GI: 4196881 
225337-06-4 
547 

mRNA; linear 
Expressed sequence tag 
28 Mar 1999 

te36e07.xl soares_NhHMPu_Sl Homo sapiens cDNA clone 
IMAGE:2088804 3' similar to TR:035464 035464 

***SEMAPHORIN*** *** VIA *** _ ; ( mRNA sequence 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Eutel eostomi ; Mammalia; Eutheria; Primates ; catarrhi 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 131 a 119 c 119 g 178 t 
COMMENT : 

contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE Consortium (info@image.llnl.gov) for further information. 

Insert Length: 639 std Error: 0.00 

Seq primer: -40UP from Gibco 

High quality sequence stop: 461. 
REFERENCE: 1 (bases 1 to 547) 

AUTHOR (AU): NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP), Tumor Gene index 
JOURNAL (so): Unpublished (1997) 



FEATURES (FEAT) 
Feature Key 



Location 



Qua! ifier 



source 1..547 /organism="Homo sapiens" 

/db-xref="taxon:9606" 
/cl one="lMAGE : 2088804" 
/cl one-1 ib="Soares-NhHMPu-sl" 
/tissue-type="Pooled human 
mel anocyte , fetal heart , and 
pregnant uterus" 
/lab-host^'DHlOB" 
/note="organ: mixed (see below); 
Vector: pT7T3D-Pac (Pharmacia) 



with a modified poly! inker; 
site-1: Not I; Site-2: Eco RI; 
Equal amounts of plasmid DNA from 
three normalized libraries 
(melanocyte 2NbHM, pregnant uterus 
NbHPU, and fetal heart NbHHl9W) 
were mixed, and ss circles were 
made in vitro. Following HAP 
purification, this DNA was used as 
tracer in a subtractive 
hybridization reaction. The driver 
was PCR-amplified cDNAs from pools 
of 5,000 clones made from the same 
3 libraries. The pools consisted 
of I.M.A.G.E. clones 
260232-265223, 340488-345479, and 
484488-489479." 



SEQUENCE (SEQ) : 

1 caagactccg 
61 gttttccatc 
121 ttccgctgaa 
181 ggttgaaggc 
241 taataaagtt 
301 cctgtctaga 
361 ctatatcaac 
421 taatcatctg 
481 caaacaccgg 
541 ctgggaa 



gtaaatgact 
tgcaaacagt 
ttcatccccg 
attagttcca 
gtggcactca 
tttccatgtc 
agtataaata 
gatgtccagc 
atactgtttt 



gcgtcaatgg 
gcaacgttgg 
aatggttcca 
cagacaaaca 
tccttatgtt 
agttttttgc 
tggtccctag 
ctgtgcctct 
gtatagttgc 



caaggaagtc 
catgtttggc 
atgtatccat 
atgcatcatc 
ttcccttcat 
tacaataaat 
cagcaatgta 
gtgtggtgtt 
catgcgaaat 



agtcactgtg 
atcatatggg 
cttatagttt 
gtttttcttt 
tctgcatgtg 
ttcttccgtg 
gagggttccg 
ccgtcctggc 
actgattggc 



gctgagtata 
catctggcca 
ctgcaggaag 
agaagaactt 
tctacatcgg 
tgtgatgtgt 
ttcatgatca 
ttgtggccca 
tcagaatctt 
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LOCUS (LOC): 

GenBank acc. no. (gbn) 

GenBank version (VER) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



SOURCE: 
ORGANISM (ORGN) 



AI249189 
AI249189 

AI249189.1 Gl: 3844586 
219387-10-7 
447 

mRNA ; linear 
Expressed sequence tag 
1 Dec 1998 

qh68h06.xl Soares_f etal_l i ver_spleen__lNFLS_Sl Homo 
sapiens cDNA clone IMAGE: 1849883 3' similar to 
TR: 035464 035464 ***SEMAPHORIN*** *** VIA *** 
mRNA sequence, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrh 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 109 a 95 c 89 g 154 t 
COMMENT: 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE Consortium (info@image.llnl.gov) for further information. 

Insert Length: 654 std Error: 0.00 

Seq primer: -40UP from Gibco 

High quality sequence stop: 436. 
REFERENCE: 1 (bases 1 to 447) 

AUTHOR (AU) : nci-cgap http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP) , Tumor Gene Index 
JOURNAL (SO): Unpublished (1997) 



FEATURES (FEAT): 
Feature Key 



Location 
.447 



Qualifier 



source 



/organism="Homo sapiens" 

/db-xref="taxon:9606 M 

/cl one="lMAGE : 1849883" 

/cl one-1 i b="Soares-f etal -1 i ver-spl 

een-lNFLS-Sl" 

/sex="male" 

/dev-stage="20 week-post 
conception fetus" 
/lab-host="DHl0B (ampicillin 



resistant)" 

/note="organ: Liver and spleen; 
vector: pT7T3D (Pharmacia) with a 
modified polylinker; site-1: Pac 
I; Site-2: Eco Rl; This is a 
subtracted version of the original 
Soares fetal liver spleen 1NFLS 
library. 1st strand cDNA was 
primed with a Pac I - oligo(dT) 
primer [5 1 

AACTGGAAGAATTAATTAAAGATCTTTTTTTTTT 
TTTTTTTTT 3'], doubl e-st randed 
cDNA was li gated to Eco Rl 
adaptors (Pharmacia), digested 
with Pac I and cloned into the Pac 
I and Eco Rl sites of the modified 
pT7T3 vector. Library went 
through one round of 
normalization. Library constructed 
by Bento Soares and M.Fatima 
Bona! do. " 



SEQUENCE (SEQ) : 

1 tgttactgat tttagaaaca gtattaagca ccttgttgtt ttgtttctgt tttcaaggaa 

61 tgcagattct actaagatct ccttgcaaaa tgaatgcatt taattttccc tatgtgtttt 

121 tcctctgcag gtagttttcc caagagtggc tcaggtttgt aagaatgata tgggaggatc 

181 tcaaagagtc ctggagaaac agtggacgtc gttcctgaag gcgcgcttga actgctcagt 

241 tcctggagac tctcattttt atttcaacat tctccaggca gttacagatg tgattcgtat 

301 caacgggcgt gatgttgtcc tggcaacgtt ttctacacct tataacaggt aatcatgccc 

361 tagctgtgtt gacctcatca attcttcctg gcttccttct cccaagcgtt ctcttctaat 
421 aaaccatatt tgcaactgac tgaaaat 
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LOCUS (LOC) : 

GenBank acc. no. (gbn) 

GenBank version (ver) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF): 



SOURCE: 
ORGANISM (ORGN); 



AI247563 GenBank (R) 
AI247563 

AI247563.1 Gl: 3842960 
219370-86-2 
359 

mRNA; linear 
Expressed sequence tag 
1 Dec 1998 

qh60e03.xl Soares_fetal_liver_spleen__lNFLS_sl Homo 
sapiens cDNA clone image: 1849084 3' similar to 
TR: 035464 035464 ***SEMAPHORIN*** *** V IA*** . ;, 

mRNA sequence, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) :75a 92 c 79 g 113 t 
comment: 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-rOmail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE consortium (info@image.llnl.gov) for further information. 

Insert Length: 1108 std Error: 0.00 

Seq primer: -40UP from Gibco 

High quality sequence stop: 307. 
REFERENCE: 1 (bases 1 to 359) 

AUTHOR (AU) : nci-cgap http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP) , Tumor Gene Index 
JOURNAL (SO): unpublished (1997) 



FEATURES (FEAT); 
Feature Key 



Location 



Qualifier 



source 



1. .359 



/organism="Homo sapiens" 

/db-xref="taxon:9606" 

/cl one=" IMAGE : 1849084" 

/clone-! i b="Soares-f etal -1 i ver-spl 

een-lNFLS-sl" 

/sex="male" 

/dev-stage="20 week-post 



conception fetus" 
/lab-host="DH10B (ampic-min 
resistant)" 

/note="Organ: Liver and Spleen; 
Vector: pT7T3D (Pharmacia) with a 
modified polyl inker; Site-1: Pac 
I; Site-2: Eco RI; This is a 
subtracted version of the original 
soares fetal liver spleen 1NFLS 
library. 1st strand cDNA was 
primed with a Pac I - oligo(dT) 
primer [5' 

AACTGGAAGAATTAATTAAAGATCTTTTTTTTTT 
TTTTTTTTT 3'], double-stranded 
cDNA was li gated to Eco RI 
adaptors (Pharmacia), digested 
with Pac I and cloned into the Pac 
I and Eco RI sites of the modified 
pT7T3 vector. Library went 
through one round of 
normalization. Library constructed 
by Bento Soares and M.Fatima 
Bona! do. " 



SEQUENCE (SEQ): 

1 atctttctct gttgtatttt tccgtcttgt acactctttc cttatggtga attgccttga 

61 ctgcctttta ttaccaaggg tatttgttgg atcacagaat ctttatttct taagagatat 

121 ttgctgcttc tacttaagtc acaatccttt gtcacaggac aatcatttcc acagcaacca 

181 actgtgatgt cacaattgaa gggaggagga ccctcaaagg ccacgaccag ctggttcccg 

241 tcaccctctt ggccattgca gtcatcctgg ctttcgtcat gggggccgtc ttctcgggca 

301 tcaccgtcta ctgcgtctgt gatcatcggc gcaaagacgt gggaaaactg ggccgcgtg 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN) : 



AI240716 GenBank (R) 
AI240716 

AI240716.1 Gl: 3836113 

219272-64-7 

299 

mRNA ; linear 
Expressed sequence tag 
1 Dec 1998 

qh50a02.xl soares_fetal_l iver_spleen„lNFLS_sl Homo 
sapiens cdna clone IMAGE: 1848074 3' similar to 
TR:035464 035464 ***SEMAPHORIN*** *** V IA*** . 

mRNA sequence, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhi 
Hominidae; Homo 
63 a 76 c 59 g 101 t 



NUCLEIC ACID COUNT (NA) 
COMMENT : 

Contact: Robert strausberg, Ph.D. 
Email : cgapbs-rOmail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE consortium (info@image.llnl.gov) for further information. 

Insert Length: 1136 Std Error: 0.00 

Seq primer: -40UP from Gibco 

High quality sequence stop: 239. 
REFERENCE: 1 (bases 1 to 299) 

AUTHOR (AU) : NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (CGAP) , Tumor Gene index 
JOURNAL (SO): Unpublished (1997) 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 1..299 /organism="Homo sapiens" 

/db-xref="taxon:9606" 
/cl one= M IMAGE : 1848074" 
/clone-1 ib="Soares-fetal-liver-spl 
een-lNFLS-sl" 
/sex="male" 

/dev-stage="20 week-post 



SEQUENCE (SEQ) : 

1 atctttctct 
61 ctgcctttta 
121 ttgctgcttc 
181 actgtgatgt 
241 tcaccctctt 



conception fetus" 
/lab-host="DHl0B (ampicillin 
resistant)" 

/note="Organ: Liver and spleen; 
Vector: pT7T3D (Pharmacia) with a 
modified polylinker; Site-1: Pac 
I; site-2: Eco RI; This is a 
subtracted version of the original 
Soares fetal liver spleen 1NFLS 
library. 1st strand cDNA was 
primed with a Pac I - oligo(dT) 
primer [5 1 

AACTGGAAGAATTAATTAAAGATCTTTTTTTTTT 
TTTTTTTTT 3'], doubl e-st randed 
cDNA was li gated to Eco RI 
adaptors (Pharmacia), digested 
with Pac I and cloned into the Pac 
I and Eco RI sites of the modified 
pT7T3 vector. Library went 
through one round of 
normalization. Library constructed 
by Bento Soares and M.Fatima 
Bona! do. " 



gttgtatttt tccgtcttgt acactctttc cttatggtga attgccttga 
ttaccaaggg tatttgttgg atcacagaat ctttatttct taagagatat 
tacttaagtc acaatccttt gtcacaggac aatcatttcc acagcaacca 
cacaattgaa gggaggagga ccctcaaagg ccacgaccag ctggttcccg 
ggccattgca gtcatcctgg ctttcgtcat gggggccgtc ttctcggca 
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LOCUS (LOC) : 

GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



AI122050 GenBank (R) 
AI122050 

AI122050.1 GI:3522374 

214710-75-5 

163 

mRNA; linear 
Expressed sequence tag 
2 Sep 1998 



SOURCE: 

ORGANISM (ORGN): 



NMLMG MUS mUSCUluS CDNA 

to TR:035464 035464 
*** VIA *** m j ( mRNA sequence. 



uc46b03 . rl soares_mammary_gl andj 
clone IMAGE: 1401005 5' similar t< 
***SEMAPHORIN*** 

house mouse. 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
NUCLEIC ACID COUNT (NA) : 27 a 43 c 51 g 42 t 
COMMENT: 

Contact: Marra M/Mouse EST Project 

WashU-HHMI Mouse EST Proiect 

Washington University School of MedicineP 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email : mouseest@watson .wustl .edu 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
MGI: 912721 

Trace considered overall poor quality 
Possible reversed clone: similarity on wrong strand 
Seq primer: -28ml3 rev2 ET from Amersham 
High quality sequence stop: 1. 



REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 
JOURNAL (SO): 

FEATURES (FEAT): 
Feature Key 



Allen ,M. ; Bowles.M 
; Kucaba,T. ; Lacy,M 



1 (bases 1 to 163) 
Marra, M. ; Hillier, L. 
Dubuque, T. ; Geisel ,S 
Martin, J . ; Morris,M. 
Tan,F. ; underwood, K. 
Lennon,G. ; Soares, B. , 
The Washu-HHMI Mouse EST Project 
Unpublished (1996) 



Dietrich ,N. 
Le,M. ; 

Schellenberq,K. ; steptoe,M.; 
Moore, B.; Tneising,B.; wylie,T.; 
Wilson, R.; Waterston.R. 



Location 



Qualifier 



source 1..163 /organism="Mus musculus" 

/db-xref ="taxon : 10090" 
/cl one=" IMAGE : 140100 5 " 
/cl one-1 i b="soares-mammary-gl and-N 
MLMG" 

/sex="female (lactating)" 
/tissue-type= , 'mammary gland" 
/lab-host=DHlOB" 
/note= n Vector: pT7T3D-Pac 
(Pharmacia) with a modified 
polyl inker; 1st strand cDNA was 
prepared from mammary gland tissue 
from a lactating female, and was 
then primed with a Not I - 
oligo(dT) primer. Double-stranded 
cDNA was li gated to Eco RI 
adaptors (Pharmacia), digested 
with Not I and cloned into the 
Not I and Eco RI sites of the 
modified pT7T3 vector. Library is 
normalized. Library was 
constructed by Bento soares and M. 
Fat i ma Bonaldo. " 



SEQUENCE (SEQ) : 

1 gatcataggg ctggagctgg acactgaggg tcacaggctt tttgtggcct ttcctggatg 
61 tatcgtctac ctctctctca gccggtgtgc ccgcgatgga gcatgtcaga ggagctgcct 
121 ggcttctctg gacccatact gtggatggca tcgatccaga ggc 
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AA939047 GenBank (R) 
AA939047 

AA939047.1 GI: 3098960 
207372-37-0 
487 

mRNA; linear 
Expressed sequence tag 
7 Jul 1998 

op56h03.sl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE:1580885 3 f similar to TR:035464 035464 

***SEMAPHORIN*** *** VIA *** _ . f mRNA sequence, 
human . 

Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 124 a 146 c 117 g 100 t 
COMMENT : 

Contact: Robert strausberg, Ph.D. 
Email : cgapbs-rOmail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 

IMAGE Consortium (info@image.llnl.gov) for further information. 

insert Length: 1276 Std Error: 0.00 

Seq primer: -40ml3 fwd. ET from Amersham 

High quality sequence stop: 242. 
REFERENCE: 1 (bases 1 to 487) 

AUTHOR (AU) : NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE (TI) : National Cancer Institute, Cancer Genome Anatomy 

Project (cgap) , Tumor Gene Index 
JOURNAL (SO): Unpublished (1997) 



LOCUS (LOC): 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (Cl) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN) : 



FEATURES (FEAT) : 

Feature Key Location 

====== =====-===+=-—-===-—=: 

source 1..487 



Qualifier 



/organism="Homo sapiens" 
/db-xref="taxon:9606" 
/clone= M IMAGE: 1580885" 
/cl one-1 ib="soares-NFL-T-GBC-sl" 
/lab-host="DHl0B" 
/note="Organ : pooled; Vector: 
pT7T3D-Pac (Pharmacia) with a 
modified polylinker; site-1: Not 
I; site-2: Eco RI; Equal amounts 
of plasmid DNA from three 



normalized libraries (fetal lung 
NbHL19W, testis NHT, and B-cell 
nci-cgap-gcbI) were mixed, and ss 
circles were made in vitro. 
Following HAP purification, this 
DNA was used as tracer in a 
subtractive hybridization 
reaction. The driver was 
PCR-amplified cDNAs from pools of 
5,000 clones made from the same 3 
libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 
682632-687239, 726408-728711, and 
729096-731399. Subtraction by 
Bento soares and M. Fatima 
Bona! do . 



SEQUENCE (SEQ) : 

1 tttcatgcag gacattgatt tatttgttca aaaagtaccc tactgtgtgc caagtactga 

61 gctacatgct gggtatctgg aggatgccga cccttttccg atactcccag ggtattcaga 

121 atgctgctct gctgctgcta caggacacac actcaggtat gtgggctgag caagggaagg 

181 tgaggttggg acaagaggac tcaccataag cagaatcccc agggccagac tgactcccag 

241 tagctccatc ttggcagtca ccatgctcca tggattcctg gttcccagcc tgatccacat 

301 cagtcccacc agatcccctg atatccacac agcccctgga gctatgccat ccacagtatg 

361 ggtcctgaga agccaaacag ctcctctgac aggccccatg ccgggcaaac cgctgtagag 

421 ggaggtagac aatacagcca gaaaaagccc cacaaagcct gtgaccccca gtgtccagct 
481 caagccc 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



Gl: 262 3161 



***semaphorin** 



frifrfcy la* * * 



mRNA, 



AF030430 
AF030430 
AF030430.1 
200046-44-2 
2770 

mRNA; linear 
Rodents 
15 Nov 1997 
Mus musculus 
complete cds 
house mouse. 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
: 720 a 735 c 701 g 614 t 

1 (bases 1 to 2770) 

zhou,L.; White, F. A.; Lentz,s.l.; Wright, D.E.; 
Fisher, D. A.; snider, w.D. 

Cloning and expression of a novel murine semaphorin 
with structural similarity to insect semaphorin I 
Mol . Cell. Neurosci., 9 (1), 26-41 (1997) 
CA 127:106980 

2 (bases 1 to 2770) 
Zhou , L. 

Direct submission 
submitted (17-OCT-1997) Neurology, 
University, 660 S. Euclid Ave., St 
USA 



Washington 
Louis, MO 63110, 



FEATURES (FEAT) : 

Feature Key Location 

source 1. .2770 

CDS 74.. 2 740 



Qualifier 

/organism="Mus musculus" 

/db-xref="taxon : 10090" 

/clone-lib="mEl0 cDNA library" 

/codon-start=l 

/product="semaphorin Via" 

/protei n-i d="AAB86408 . 1" 

/db-xref="GI:2623162" 

/trans! ation="MRPAALLLCLTLLHCAGAGF 

PEDSEPISISHGNYTKQYPVFVGH 

KPGRNTTQRHRLDIQMIMIMNRTLYVAARDHIYT 

VD I DTS HTE E I YCS KK LTWK S RQA 

DVDTCRMKGKHKDECHNFIKVLLKKNDDTLFVCG 

TNAFNPSCRNYRVDTLETFGDEFS 



SEQUENCE (SEQ): 

1 agccgcggcc gcctgcccgt cgatgcaccg 
61 cgctgaacct actatgcggc cagcagcctt 
121 tggggctggt ttcccagaag attccgagcc 
181 acagtatccg gtgtttgtgg gccacaagcc 
241 ggacatccag atgatcatga tcatgaacag 
301 ttatactgtt gatatagaca catcccacac 
361 atggaaatct agacaggctg acgtagacac 
421 atgtcacaac ttcattaaag ttcttctcaa 
481 aaccaatgcc ttcaaccctt cctgcagaaa 
541 ggatgaattt agcggaatgg ccagatgccc 
601 gtttgcagat ggaaaactct actcggctac 
661 catttacagg agcctcggag acagccctac 
721 gttgaaagag ccgtactttg tccaagccgt 
781 cagagaaatt gcagtagaat acaacactat 
841 ggtctgtaag aatgacatgg gagggtctca 
901 cctgaaggct cgcctgaact gctcggtgcc 
961 ccaggcagtt acagatgtga ttcgcattaa 
1021 cacaccttat aacagcatcc caggttctgc 
1081 taatgttttc actgggaggt tcaaggaaca 
1141 tccagacgaa cgagtcccta agcccaggcc 
1201 aaaatatgca acctccaatg agtttcccga 
1261 actcatggac gaggcagtgc cttccatcat 
1321 cagataccgc ctgaccaaaa ttgcagtaga 
1381 tgtggttttc ctgggatcag aaaagggaat 
1441 cagtggtttc ctaaatggca gccttttcct 
1501 gtgcagctat gatggtgtag aagacaaaag 
1561 tggctcactc tatgttgcat tctctacttg 
1621 gcgacatggg aagtgtaaaa aaacctgcat 
1681 aagggaaagt ggttcctgtg cccatctgtc 
1741 cattgagcgt ggcaatacgg acggcctagg 
1801 tgggcacgcc agttccctct atcccagcac 
1861 gtatgagtct aggggaggca tgctggactg 
1921 agaccctttg ggggcagtgt cctctcataa 
1981 aagttacctc aaaagcaacg accagcttgt 
2041 tctggctttt gtcatggggg ccgtcttctc 
2101 ccggcgcaaa gacgtggcag tagtgcagcg 
2161 gggatctatg agcagtgtca ccaagctcag 
2221 cccaaagcct gaggccatcc tcacaccact 
2281 caacaccgcc aagatgctca tcaaggctga 
2341 caccccagag tccaccccga cactgcagga 
2401 gtgggagagg aaccagaaca tcatcaatgc 
2461 ccctgtgatt cccacggacc tgcccctccg 
2521 ggtcctgccc atcacgcagc agggctacca 
2581 cgaggtggtg gctcagatgg cactggagga 
2641 caaagagcac ctgagtagcg agagttcacc 
2701 caacagacaa ggaatcattc tctccgtagc 
2761 gcatggtaac 
STN INTERNATIONAL LOGOFF AT 13:24:29 ON 



GMARCPYDAKHANIALFADGKLYSATVTDFLAID 

AVIYRSLGDSPTLRTVKHDSKWLK 

EPYFVQAVDYGDYIYFFFREIAVEYNTMGKWFP 

RVAQVCKNDMGGSQRVL EKQWTS F 

LKARLNCSVPGDSHFYFNILQAVTDVIRINGRDV 

V L AT FST P YN S I PGS AVCAYDM LD 

IANVFTGRFKEQKSPDSTWTPVPDERVPKPRPGC 

CAGSSSLEKYATSNEFPDDTLNFI 

KTRPLMDEAVPSIINRPWFLRTMVRYRLTKIAVD 

NAAGPYQNHTWFLGSEKGIILKF 

LARIGSSGFLNGSLFLEEMNVYNPEKCSYDGVED 

KRIMGMQLDRASGS LYVAFSTCVI 

KVPLGRCERHGKCKKTCIASRDPYCGWVRESGSC 

AHLSPLSRLTFEQDIERGNTDGLG 

DCHNS FVALNGHASS LYPSTTTSDSAS RDGYESR 

GGM L DWN D L L E A PGSTD P LGAVS S 

HNHQDKKGVIRESYLKSNDQLVPVTLLAIAVILA 

FVMGAVFSGI I VYCVCDH RRKDVA 

WQRKEKELTHSRRGSMSSVTKLSGLFGDTQSKD 

PKPEAILTPLMHNGKLATPSNTAK 

MLIKADQHHLDLTALPTPESTPTLQEKRKPNRGS 

R EWE RNQN I IN ACTKDM P PMGS PV 

I PTD L P L RAS PS H I PS VW L PITQQGYQH EYVDQ 

PKMS EWAQMA L EDQAAT L EYKTI 

KEHLSSESSPYVLKQFSEAFNRQGIILSVAVE" 



aaaagggtga agtagagaag caagtccccc 
actgctgtgt ctcacactgc tacactgcgc 
aatcagtatt tcgcatggca actatacaaa 
aggacggaac accacgcaga ggcacaggct 
aaccctctac gttgctgctc gagaccatat 
agaagaaatt tactgtagca aaaaactgac 
atgcaggatg aaggggaaac ataaggatga 
gaagaatgat gatacgctgt ttgtctgtgg 
ctacagggtc gataccttgg aaacttttgg 
ttatgatgcc aaacatgcca acatcgctct 
agtgactgac tttctggcca ttgatgcagt 
cctcaggact gtcaagcatg attcaaagtg 
ggattatggg gactatatct acttcttctt 
ggggaaggtt gttttcccta gggtggctca 
gagagtcctg gagaagcagt ggacatcttt 
tggagactct catttttatt tcaatatact 
tggccgtgat gttgtcttgg caaccttttc 
agtctgtgcc tatgacatgc ttgacattgc 
gaaatcacct gactctacct ggacacccgt 
a 99 c tgttgt gctggatcat cctctttaga 
tgataccctg aacttcatta agacgcatcc 
caacagacct tggttcctga gaacaatggt 
caacgctgcc gggccatatc agaatcacac 
catcctgaag ttcttggcca ggataggaag 
ggaggagatg aatgtttaca acccagaaaa 
gatcatgggc atgcagctcg acagagcgag 
tgtgatcaag gtgcctcttg gccgctgtga 
cgcctccaga gacccgtatt gtgggtgggt 
accccttagc agactgacat ttgagcagga 
agactgtcac aattccttcg tggcactgaa 
cactacgtca gattcggcat cccgagacgg 
gaacgacctg ctcgaggcac ctggcagcac 
ccaccaggac aagaagggag tgattcggga 
tcctgtcacc ctcctggcca ttgcagtcat 
gggcatcatc gtgtattgtg tgtgcgatca 
caaggagaaa gagctcactc actcgcgtcg 
tggcctcttt ggggacaccc agtccaagga 
catgcacaac ggcaagctgg ccacgcctag 
ccagcatcac ctagacctca ccgccctgcc 
gaaacggaaa cccaaccgcg ggagtcgcga 
ctgcaccaag gacatgcctc ccatgggttc 
ggcctcccca agccacatcc ccagcgtggt 
gcacgagtac gtagatcagc ccaaaatgag 
ccaggctgcc accctggagt ataagaccat 
ctatgttctg aagcaatttt ctgaagcctt 
ggtagaatag ctgctgcaga tggtgtcgct 
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